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2.  Summary  of  Progress  During  Reporting  Period 

The  primary  direction  of  progress  on  this  project  during  the 
reporting  period  has  been  toward  the  design  of  robust  linear  estimation 
and  control  procedures  for  uncertain  models.  Generally  speaking,  a 
robust  procedure  is  one  which  is  insensitive  (in  terms  of  performance)  to 
small  deviations  from  an  assumed  model.  One  of  the  most  successful  approaches 
to  robust  design  la  a  game  theoretic  one  in  which  a  procedure  is  sought 
to  have  the  best  worst-case  performance  over  a  relevant  class  of  models 
neighboring  the  assumed  (i.e.,  nominal)  model.  Thus,  the  primary  design 
philosophies  in  this  study  have  been  rainimax-meaa-square-error  estimation 
and  minimax  quadratic  control.  Related  game  theoretic  approaches  to 
hierarchical  stochastic  decision  problems  and  to  antijamming  strategies 
have  also  been  considered  during  the  reporting  period.  A  brief  descrip¬ 
tion  of  the  results  obtained  during  the  reporting  period  is  contained 
in  the  following  paragraphs.  More  complete  details  of  these  results  can 
be  found  in  the  publications  listed  at  the  end  of  this  discussion,  copies 
of  which  are  attached  as  appendices  to  this  report. 

The  problem  of  robust  linear  smoothing  of  a  stationary  random 
signal  with  uncertain  spectrum  observed  in  additive  noise  with  uncertain 
spectrum  is  considered  in  [6].  Here,  a  general  solution  to  this  problem 
is  given  for  spectral  uncertainty  classes  of  a  general  type  based  on 


Choquet  capacities.  This  type  of  model  includes  standard  uncertainty 
models  such  as  contaminated  mixtures  as  well  as  several  topological 
models  of  uncertainty.  Further,  an  extensive  numerical  study  [8]  in¬ 
dicates  that  the  worst-case  performance  of  the  proposed  technique  is 
generally  much  better  than  that  resulting  from  designs  which  simply 
ignore  the  presence  of  uncertainty.  More  general  problems  of  robust 
estimation  of  stationary  signals  (including  smoothing,  filtering  and 
prediction)  also  have  been  considered  in  the  case  of  discrete  time  [9]. 
Here,  a  general  minimax  result  is  given  from  which  robust  solutions 
to  a  variety  of  problems  follow  straightforwardly. 

Also  considered  in  this  study  are  the  problems  of  state  estima¬ 
tion  and  control  in  linear  stochastic  systems  with  uncertain  noise 
statistics.  In  [7],  two  aspects  of  minimax  MSE  state  estimation  are 
considered.  These  are:  rninimax  state  estimation  for  single-variable 
systems  with  uncertain  process  or  observation  noise  spectra  and  rninimax 
state  estimation  for  multivariable  systems  with  white  noises  of  uncer¬ 
tain  componentwise  correlation.  In  each  problem,  a  rninimax  theorem  is 
proven  indicating  that  the  robust  state  estimator  is  the  minimum  MSE 
filter  for  a  least -favorable  model.  Thus,  for  example,  in  the  second 
problem  the  robust  solution  is  the  Kalman-Bucy  filter  for  a  least-favor¬ 
able  pair  of  process  and  state  noise  covariance  matrices.  The  related 
problem  of  rninimax  estimation  with  nonhomogeneous  Poisson  observations 
processes  with  uncertain  rate  functions  has  also  been  considered  [3]. 
Here,  analogies  with  the  continuous-observations  case  are  exploited  to 
obtain  straightforward  rninimax  solutions  to  this  problem.  Minimax 
linear-quadratic  control  within  the  second  formulation  mentioned  above 
(i.e.,  white  noises  with  uncertain  componentwise  correlation)  is  con¬ 
sidered  in  [4,3].  Results  similar  to  those  for  state  estimation  in  [7] 
are  found  to  hold  for  this  problem.  Also,  an  interesting  result  con¬ 
cerning  the  seperability  of  estimation  and  control  is  observed  in  this 
problem.  In  particular,  it  is  seen  that  the  rninimax  controller  design 
is  independent  of  that  of  the  state  observer,  whereas  the  reverse  is 
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not  necessarily  true.  Thus,  for  example,  the  state  observer  for  mini- 
max  control  is  different  from  the  minimax  state  estimator  of  [7]  for 
the  open-loop  system.  This  is  in  contrast  to  the  analogous  problem 
without  uncertainty  in  which  estimation  and  control  objectives  separate. 

As  noted  above,  game  theoretic  analyses  have  also  been  applied 
to  the  problems  of  hierarchical  stochastic  decision  making  and  anti¬ 
jamming.  In  particular,  [1]  considers  the  class  of  stochastic  multi- 
person,  multicriteria  decision  problems  (defined  on  general  Hilbert 
spaces)  with  quadratic  objective  functionals,  static  information  struc¬ 
ture,  and  with  a  hierarchical  structure  with  regard  to  the  order  in 
which  desicions  are  announced.  Here,  a  set  of  conditions  is  obtained 
under  which  a  unique  equilibrium  solution  exists  and  can  be  determined 
as  the  limit  of  an  infinite  sequence.  Further,  [2]  considers  the  pro¬ 
blem  of  transmitting  a  sequence  of  independent  and  identically  distrib¬ 
uted  Gaussian  random  variables  through  a  memoryless  Gaussian  wiretap 
channel  with  an  intelligent  janmer.  Under  a  minimax  MSE  criterion,  the 
complete  set  of  strategies  for  the  jammer  and  transmitter  is  obtained 
for  this  problem  within  power  constrainat  on  transmitter  and  jananer. 

A  variety  of  solutions  is  possible  depending  on  the  relative  power 
constraints  of  the  players  and  the  noise  levels  in  the  transmission 
and  wiretap  channels. 
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4.  Appendix:  Copies  of  Publications 

This  appendix  contains  copies  of  the  publications  listed  in  Section  3 


above,  with  the  exception  of  [5]  which  is  currently  being  put  into  final  form. 
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T.  Basar,  "Hierarchical  Equilibrium  Solutions  in  Stochastic  Decision 
Problems  Defined  on  General  Hilbert  Spaces,"  submitted  for 
publication  to  Information  and  Control. 
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University  of  Illinois 
1101  W.  Springfield  Av. 
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ABSTRACT: 

This  paper  considers  the  class  of  stochastic  multi-person  multi- 
criteria  decision  problems  defined  on  general  Hilbert  spaces,  with  quadratic 
objective  functionals,  static  information  structure,  and  with  the  mode  of 
decision  making  requiring  the  decision  makers  to  announce  their  strategies 
in  a  sequential  order.  A  set  of  conditions,  independent  of  the  probabil¬ 
istic  structure  of  the  problem,  is  obtained,  under  which  the  hierarchical 
equilibrium  solution  exists,  is  unique  and  can  be  determined  as  the  con¬ 
vergent  limit  of  an  infinite  sequence.  The  analysis  is  confined  primarily 
to  the  three-person  case,  in  which  context  explicit  conditions  and  strategies 
are  obtained,  but  extensions  to  the  case  of  more  than  three  decision  makers 
are  also  elucidated. 


★ 

Manuscript  prepared  June  10,  1981. 


1.  INTRODUCTION 


This  paper  introduces  and  discusses  a  general  approach  towards  deriv¬ 
ation  of  the  optimum  (hierarchical  equilibrium)  solution  of  a  class  of 
stochastic  multi-person  multi-criteria  decision  problems  which  incorporate 
multi  levels  of  hierarchy  in  decision  making.  Analysis  is  primarily 
confined  to  three-person  decision  problems  defined  on  general  inner-product 
spaces,  with  quadratic  objective  functionals,  and  with  the  mode  of  decision 
making  requiring  the  decision  makers  to  announce  their  policies  in  a  sequen¬ 
tial  order;  but,  extensions  to  other  types  of  decision  problems  with  more 
than  three  decision  makers  and  other  modes  of  decision  making  are  also 
discussed.  The  stochastic  decision  problems  covered  by  our  general  frame¬ 
work  are  those  wich  finite  dimensional  control  (decision)  spaces,  those 
defined  in  continuous -time,  with  lumped  or  distributed  parameters,  as  well 
as  the  ones  whose  state  dynamics  are  described  by  differential-delay 
equations . 

One  of  the  important  results  obtained  in  the  paper  is  that  under 
suitable  conditions  (which  are  explicitly  obtained),  independent  of  the 
probabilistic  structure  of  the  problem,  the  equilibrium  solution  is  unique 
and  it  can  be  determined  as  the  convergent  limit  of  an  infinite  sequence. 

For  the  special  case  of  Gaussian  distributions  [such  as  the  cases  when  all 
primitive  random  variables  are  Gaussian  vectors  (in  the  finite  dimensional 
case),  or  are  Gaussian  stochastic  processes  (in  the  continuous-time  case)] 
the  optimum  strategies  of  the  decision  makers  are  affine  functions  of  the 
available  static  information. 

IX©  special  versions  of  this  problem  have  been  considered  before  for 
the  two-person  case.  Ba^ar  (1980)  discusses  the  case  when  the  decision 
variables  belong  to  finite  dimensional  spaces,  and  Bagchi  and  Ba^ar  (1981) 


discuss  che  continuous -time  version  when  Che  decision  makers  make  noisy 
observations  of  the  initial  state.  In  a  way,  the  present  paper  presents 
nontrivial  extensions  of  these  results  to  the  M(>  3) -person  case,  but  th»; 
solution  here  is  not  as  explicit  (in  analytic  form)  as  in  those  two  papers 
because  the  framework  here  is  more  general. 

A  precise  mathematical  formulation  of  the  problem  is  presented  in 
Section  2,  and  the  general  solution  is  obtained  in  Section  3.  For  some 
background  material  on  functional  analysis  that  is  employed  in  these  two 
sections,  the  reader  is  referred  to  Balakrishnan  (1976)  and  Kantorovich  and 
Akhilov  (1977).  Section  4  treats  a  special  case,  and  Section  5  discusses 
possible  extensions  to  more  general  models.  The  paper  concludes  with  an 
Appendix. 
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2.  GENERAL  FORMULATION 

Let  (fi,  3,  p)  be  an  underlying  probability  space,  and  (i  »  0,1, 2, 3) 

12  12 

be  separable  Hilbert  spaces  with  inner  products  (h  ,  h  )^,  for  h  ,  h  e  H^. 

Let  be  an  H^-valued  weak  random  variable  [cf.  Balakrishnan  (1976)  1  defined 

on  (n,  3,  p) ,  and  denote  the  sigma-algebra  generated  by  z (i  ■  1,2,3). 

Furthermore,  z^  has  a  finite  second  moment,  i.e. 

(i)  E{|(zt,  h^J2}  <  »  for  every  hi  e  Ht 

2 

(ii)  E{|(z.,  h^)  j  }  is  continuous  in  hi, 

where  E{.}  denotes  the  total  expectation  over  the  underlying  probability  space. 

Let  (i  *  1,2,3)  be  separable  Hilbert  spaces  with  inner  products 
12  12 

< s  ,  s  >i,  for  s  ,  s  e  and  u^  be  an  Sj-valued  weak  random  variable  defined 

on  (n,  3,  0  and  satisfying  the  following  two  properties: 

(i)  u^  is  a^-neasurable, 

(ii)  u^  has  finite  second  moments. 

We  call  such  a  u^  a  permissible  control  (decision)  variable  of  the  i-th 
decision  maker  [abbreviated,  DMi  ].  Equivalently,  we  can  Introduce  a  permis¬ 
sible  strategy  for  DMi  as  a  mapping  y^H^  ■*  sucl1  that  is  a^"®«®surable 

and  has  finite  second  moments.  Denote  the  class  of  all  such  mappings  for  DMi 

by  [  to  be  called  the  strategy  space  of  DMi J,  which  is  in  fact  a  Hilbert 

12  12 

space  under  the  inner  product  Et^^Zj),  y^Zj)^},  for  y^,  y  e  1^.  In 

this  general  formulation  [using  the  standard  terminology  of  decision  theory], 
zq  denotes  the  state  of  Nature  and  z^  (i  ■  1,2,3)  denotes  the  measurement 
available  to  DMi  —  all  these  (weak)  random  variables  are  static,  in  the  sense 
that  they  do  not  depend  on  the  controls  (actions)  of  the  decision  makers. 
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In  order  Co  complete  Che  formulae  ion,  we  introduce  general  quadratic 
objective  (cost)  functionals  for  the  decision  makers  on  the  product  strategy 
spaces  ^  x  r3  as 

Ji/Yl’  y2 *  y3^  *  E{si^2o*  *V  u2 ’  u3^uj  "  Yj^*j^*  -i  “  1»2»3^  ^2 •  1) 

where 


8i(V  V  V  u3}  -  2  <ui»  Vi  -  <ui*  DijVl  -  <ui*  DikVi 
+  7  <ur  Yj  +  K-  4Vk  -  <Y  F}kYj  - <V  cl‘o>i  <2-2> 

“  <uj*  ‘jVj  ~  <uk>  CJVk  ^  1.  j,  k,  -  1,  2,  3,  J  4  k  *  i,  j  <  k. 

Here,  ,  Dik,  F  * ^  ,  F^,  Fjk,  C*,  C*,  c£  are  linear  bounded  operators  defined 
on  appropriate  Hilbert  spaces,  with  Fjj  and  Fkfc  being  also  self-adjoint. 

Our  objective  in  this  paper  is  to  investigate  the  existence,  uniqueness 
and  derivation  of  a  hierarchical  equilibrium  solution  for  this  stochastic 
decision  problem,  in  the  presence  of  a  linear  hierarchy  for  decision  making. 
Specifically,  it  is  assumed  Chat  the  strategies  are  announced  sequentially  ~ 
first  DM1  announces  his  strategy  and  makes  it  known  to  both  DM2  and  DM3,  then 
DM2  announces  his  strategy  and  makes  it  known  to  DM3,  and  finally  DM3  decides 
on  his  optimal  strategy.  Each  DM  strives  to  obtain  a  minimum  value  for  his 
cost  functional,  thereby  leading  to  the  following  definition  of  a  hierarchical 
equilibrium  solution: 

Definition  2.1.  The  set  of  strategies  {y°  e  i  ■  1,  2,  3}  is  in  hierarchical 
equilibrium  if 


(i)  jl(y J,  t2(y°),  t3[yJ,  t2(y°)1)  <  J1(Y1,  T2(Yl),  t3Iy1,t2(y1)1) 


v  Yi  C  ri 
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(ti)  Y°  -  T2(Y°) 

(iii)  >°  -  T3(y°,  y°) 

where  T2:  -*  T, ,  *  r2  ■*  are  unique  measurable  mappings  satisfying 

the  Inequalities  J2(y^,  T2^Y1^'  ^3^1’  ^2^Y1^  ^  ^2^1*  Y2 ’  *3^Y1,Y2^’ 

W  >1  e  rx,  y2  e  f2 , and  J^y^,  Yj.  t3^i»  Y,))  <  J3^Y1’  Y2*  Y3^ '  W  Yi  £  ri* 

1  -  1,  2,  3.  3 

Remark  2.1.  The  assumption  of  uniqueness  of  T2  and  In  Def.  2.1  is  not 
restrictive  here  since  the  underlying  information  structure  is  static,  and 
such  unique  mappings  exist,  as  we  shall  demonstrate  later.  If  the  information 
structure  had  been  dynamic,  however,  we  would  have  to  extend  the  definition 
in  order  to  account  for  nonunique  responses  [see  Ba^ar  (1981a, b),  for  such 
an  extension]  .  0 
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3.  DERIVATION  OF  THE  EQUILIBRIUM  SOLUTION 

Step  I.  Firstly,  to  determine  Tj,  we  hold  Y ^  c  r,  end  Y2  e  *r2  fixed  end 
minimize  J^y^,  Yj#  *)  over  ^3*  Since  g3  i*  strictly  convex  in  u^»  tl'i* 
ainimizet ion  problem  edmits  Che  unique  solution 


Y3(z3)  “  T3(y1*y2)(z3)  -  ♦  D32y2(*2)  +  C3*o'*3J 


(3.1) 


where  E{-|z^}  denotes  the  conditions!  expected  velue  of  (•)  given  the  observed 
velue  of  z^.  This  is  e  well-defined  quantity,  which  is  o3 -measurable,  by 
standerd  results  of  probability  theory  [  cf .  Lolve  (1963)  ]. 

Step  2.  Next,  to  determine  T^,  we  substitute  (3.1)  into  J2(y^,  Y2*  Y3^  *nd 
seek,  to  minimize  Che  resulting  expression  over  Y2  e  For  Y^  e  r^; 

this,  however,  is  not  e  standard  stochastic  optimization  problem,  because  of 
the  presence  of  several  conditional  expectations.  Writing  out  the  function 
to  be  minimized,  we  have 

J2^y1’  y2 *  T3^y1’  y2^  “  E^T<u2’  u2?2  ~  <u2'  D21,U1>2 

-  <u2,  D23D31Etu1|z.]  +  D23D32Etu2[z3]  +  IfccJ*  ZQ  >, 

+  T^l*  F11U1>1  'f  5<D31Elullz31  +  D32Elu2lZ31+  C3  Elzolz31,  F33D31Ef  ullZ3J 

+  F33D32E[u2lZ31  +  r33CS*zolz3l>3  -  <ul*  F13D3lE{ul!Z3J  +  AiW “2 lZ3> 

+  Fl3C3E(zo|z3l>1-  <u2,  C2zq>2  -  <ur  C1z<j>1  -  <D31Elu1|z3l 

♦  D32E[u2|z:]  +  C*E[zo|z3],  C3*o  >3  |ut  -  y^).  i  -  1,  2  } 

■  L2(Y?)  ,  for  fixed  Y3  6  r1  • 

We  now  sesk  a  Vj  t  so  that,  for  fixed  Y^  e  r^, 


9  . 
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sin  .»a*au»msMM*mew«mm«-'  - 
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1*2 V  Y2  £  r2*  (3.2) 

Sine*  r2  is  a  linear  vector  space,  to  avary  y 2  e  r2  there  corresponds  an 
h  £  Tj  such  that  y,  ■  y,  +  h.  Hence,  (3.2)  can  be  written  as 

Lj  ( Yj  ^  ^  ^2^2  +  ^  •  V  h  e  , 

and  furthermore,  since  I<2  is  quadratic,  we  have 

l2(?2  +  h)  -  L2(?^+  a.2(?2;h)  +  j62L2(?2;h), 

whereby 

L2(^)  <  l2(y2>  +  6L2(Y2;h)  +  i<S2L2(f2;h),  V  h  e  r2  .  (3.3) 

2 

Here,  6L2  and  6  L2  denote,  respectively,  the  first  and  second  Gateaux  variations 
of  L2  around  ^2>  which  are  defined  by 

5L.(-y  ;h)  -  lim  L,(?~  +  ah) 

22  a  -  0  da  2  2 

2  d2 

«  ,;h)  -  lim  L_«.  +  ah)  . 

22  a  -  0  **  22 

The  first  of  these  is  homogeneous  of  degree  one,  and  therefore  (3.3)  readily 
leads  to  the  set  of  necessary  and  sufficient  conditions 

5L2(y2;h)  -  0,  52L2(?2;h)  >  0  V  h  e  ^  •  (3.4) 

By  making  use  of  the  smoothing  property  of  conditional  expectations,  we  can 
2 

write  6L2  and  S  la  the  forms 

®*2 ( ^2 » h)  -  E{<h(z2),X2(s2,u2,ul)>2  [u2  -  y^Zj),  u2  •  y2(z2)}  (3.5a) 

and 


62L2(t2;h) 


E{<h(z2),  h(z2)>2  +  <E[h(z2)  jz3], 


V 


(D32F33D32  *  D23D32  *  D32D23*  h(*2) I *3  1>2  } 


(3.5b) 


where 


12(Z2,U2,U1)  u2  +  ^D32F33D32  ~  D23D32  "  D32D231  EfE^u2lz3^ lz2^ 

-  °2tE1“llI2l  *  "4F33D31  -  D23D31  -  ^ 3Ie1eI  “l  ^ 1«21 

-  lD23C3  +  D32C3  -  D32r33C3lElEll.|l31|l2l  *  i*t*„l*2I  • 

* 

and  super-index  scar  (  )  denotes  Che  adjoint  operator  under  the  appropriate 
inner  product.  Relations  (3.4)  and  (3.5a)  now  readily  lead  to  the  first- 
order  condition 


^(Zj*  u2,  Uj)  a  0  ,  a.e.  Pg  ,  u2  -  Y2(z2)  , 

where  P  is  the  probability  measure  Induced  under  the  weak  random  variable 
Z2 

z2.  Let  us  rewrite  this  equality  more  explicitly  as 

W  "  K  E  tE[  r2(z2)  |z3  1 1*2  1  -l-  D2lE  ^  YL  (*L)  1*2  ^ 

+  GXE  CE  Iy1(*1)  |«3  1(*2  1  +  C2E[zo|z21  +  G2E[E(zo[z3J|z2  1(3.6) 

where 


*D32F33D32  +  D23D32  +  D32D23 

*D32F33D31  +  D23D31  +  D32F13 
'D32F33C3  *  °23C3  *  °32C3 


(3.7a) 


(3.7b) 


(3.7c) 


^•^*c^on  (3.6)  is  in  fact  a  linear  operator  equation  which  has  to  be  solved 
for  Y2.  and  this  determines  the  mapping  T2  introduced  in  Def.  2.2.  The 
questions  of  existence  and  uniqueness  of  such  a  mapping,  and  satisfaction 
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of  che  second  condition  of  (3.4),  are  now  addressed  Co  in  Proposition 
3.1  Co  follow. 


Preliminary  terminology  and  notation 

Let  ^i  be  the  space  of  linear  bounded  operators  mapping  into 
itself.  For  B  e  let  (B)  and  | | Bj  J  A  denote,  respectively,  the 
spectrum  and  norm  of  B,  where  the  latter  is  defined  by 


|  J  B J  J  ±  -  sup  [^BjX,  Bj^Oj/oc,  x>t]' 
x  e  Fj 


(3.8) 


Furthermore,  let  r^(B)  denote  the  spectral  radius  of  B  which 


is  defined  by 


r.(B)  -  lim  sup  | |B  | |. 

n  -  « 


(3.9) 


Finally,  let  us  introduce  the  Hilbert  space  as  the  space  of  all  S^valued 
Cj -measurable  weak  random  variables  defined  on  (H,  3,  P)  and  with  finite 
second  moments  (i,  j  ■  1,  2,  3).  fNote  that  is  in  fact  isomorphic  to 
rt,  and  it  is  also  a  Hilbert  space.] 

Now,  by  invoking  the  following  condition 


Cl.  r_ (K)  <  1, 


we  are  in  a  position  to  state  the  proposition  given  below. 


Proposition  3.1.  Assume  condition  Cl  to  hold  true,  i)  Equation  (3.6) 

mm 

admits  a  unique  solution  Yj  «  ^  *or  every  e  which  is  also 

the  unique  solution  satisfying  (3.4);  hence  -  T2  is  uniquely  defined, 

ii)  The  unique  solution  to  (3.6)  can  be  found,  for  any  fixed  e  as 


V. 
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the  convergent  Limit  of  an  iterative  procedure  (known  as  successive  approx¬ 
imations),  which  starts  with  an  arbitrary  element  of  on  the  right-hand 
side  (RHS)  of  (3.6)  and  recursively  updates  this  choice  by  resubstituting 
(to  the  RHS)  the  strategy  obtained  on  the  left-hand  side  of  (3.6). 

ffroof .  Let  us  rewrite  (3.6)  as 

Y2(z2)  -  K  <S2i2^2(z2)  +  ky  (*2)  (3.10) 

where  ky  is  a  fixed  element  of  T2  for  every  fixed  Y^  e  and  <$2  denotes 

the  conditional  expectation  operator  ^2^2 "  ErY2 (z2) | z3},  where  the 

convention  is  such  that  the  super-index  stands  for  the  conditioning  sigma- 

field  and  the  sub-index  identifies  the  range  space  [S^,  S2  or  S^]  of  the 

weak  random  variable  whose  conditional  expectation  is  taken.  The  operator 
-.2 

o 2  is  likewise  defined  as  a  conditional  expectation  operator.  Each  of  these 

operators  is  a  pro  lection  operator  [see  e.g.  Ba^ar  (1975)],  with  <3*  mapping 

M2  into  M*,  where  M2  is  the  Hilbert  space  of  all  second-order  Sj-valued 

weak  random  variables  defined  on  (n,  3,  P).  [Note  that  ^  and  Mj  3  M^.] 

2  3 

Being  projection  operators,  both  $2  and  have  unit  norm,  are  linear  and 

2  3  2 

bounded.  Therefore  KfS^j :  ■»  is  a  linear  and  bounded  operator.  We 

may  also  take  the  range  spaces  of  ^’^2  an<*  K  as  ^  (*-a<stead  of  M^)  and 
Introduce  a  natural  extension  of  r2  to  the  space  of  linear  bounded  operators 
mapping  into  itself,  to  be  denoted  r2>  Then  we  have,  using  the  spectral 
radius  inequality  for  product  operators, 

_  2  3  _  _  3  _  , 

*  r2(K)r2(£2)r2((jJ) 

«  r2(K)  -  r2(K)  <  l,  (i) 

where  the  second  inequality  follows  from  a  known  (unit  norm)  property  of 


V 


projection  operators,  the  equality  follows  since  (by  construction)  the 
restriction  of  to  is  r^,  and  the  last  inequality  follows  from  Cl,. 

Hence,  the  spectral  radius  of  the  operator  in  (3.10)  is  less  than 
unity  [or,  equivalently,  the  spectrum  of  that  operator  is  totally  in  the 
unit  sphere],  and  by  Theorem  3  (Chapter  XIII)  of  Kantorovich  and  Akhilov 
(1977)  equation  (3.10)  admits  a  unique  solution  in  M2*  Furthermore,  by  the 
same  theorem,  the  solution  can  be  obtained  iteratively  by 

Y2(n+l)(z2)  -  K42i^2(n)(z2)  +  ky  (s2)  ,  n  -  0,  1,  .  .  . 

where  Y2°^(*)  is  any  initial  choice  in  Since  we  already  know  that  the  range 

*2  3 

space  of  K<£2<?2  comprises  only  cr2 -measurable  elements,  and  ky  e  it 

follows  that  Y.(-)  ■  limY^n^(-)  is  necessarily  a_ -measurable,  and  therefore 
-n-“  2 

the  unique  solution  Y2(z2)  of  (3.10)  in  is  in  fact  in  the  subspace  M^, 

and  to  this,  there  corresponds  a  unique  element  Y2  in  r2- 

What  remains  to  be  shown  now,  in  order  to  complete  the  proof  of  the 

Proposition,  is  satisfaction  of  the  second  order  (sufficiency)  condition 

of  (3.4).  Towards  this  end,  we  first  note  that  K  is  a  self-adjoint  operator, 

and  hence  under  Cl, 

|<x,  Kx>i|  <  <x,  x5^  ,  V  x  e  ,  x  t  8  , 

where  9  is  the  zero  element  in  M^.  If  this  inequality  is  utilized  in  (3.5b), 
we  readily  arrive  at  the  bound 


6%(Y2;h)  >  E[<h(z2),  h(z2)>2  -  <Eth(z2)|z3],  E[h(z,) |*3]>2]  ,hH 


2  1  3J 


2  1  3J  2J 


By  the  nonexpanslve  property  of  conditional  expectations  [cf.  Ba{ar  (1975)], 


Et<E[h(z2)|z3],  E[h(z2)|z3]>2]  4  Et<h(z2),  h(z2)>2]  , 
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and  therefore 

52L2(Y2;h)  >  E{<E{  h(z2)  |z3]  ,  E(  h(z2)  | Z3]  >2  -  E(h(z2>|z3], 

E(  h(z2)  |z3]  >2)  -  0, 

that  is,  the  second  Gateaux  variation  is  positive  definite.  D 

2  3 

Remark  3.1.  Under  Cl,  the  operator  [  I-K&2&2  ]  is  invertible,  and  its  inverse 
is  also  linear  and  bounded.  This  leads  to  an  operator-form  characterization 
of  T2,  which  is 


t2(Yi)(z2)  -  [i  -  KS2s2r1{(D21a2  +  G1iJa2)Yl(z1) 

+  ^C2Z0  +  &2£2G2V 


(3.11) 


where  the  definition  of  is  analgous  to  that  of  a*  in  the  proof  of  Propo¬ 
sition  3.1.  We  furthermore  note  that,  under  Cl  the  inverse  can  be  written 
in  the  form  of  an  infinite  convergent  series  [cf.  Kantorovich  and  Akhilov 
(1977)  I 


1 1  -  Ka Is?) _1  -  i  +  i(Ka2a2)n 

n-1  * 


(3.12) 

which  we  will  have  occasion  to  utilize  in  the  sequel,  at  step  3  of  the 


derivation.  u 

Step  3.  We  have  so  far  determined  T3  and  T2,  uniquely,  and  under  conditions 
which  do  not  depend  on  the  probabilistic  structure  of  the  problem.  In  order 
to  complete  the  derivation  of  the  equilibrium  solution,  we  now  consider 
the  minimization  problem  described  by  (i)  in  Def.  2.1,  so  as  to  determine  T^. 
Towards  this  end, let  us  first  substitute  the  unique  responses  of  DM3 


i 

* 


and  DM2,  as  given  by  (3.1)  and  (3.11),  respectively,  into  J^,  and  consider 
the  first  Gateaux  variation  of  the  resulting  quadratic  expression  around 
a  nominal  point  ^ 
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Ji(V  W’  Vv  t2(yi)J)  *  - 

<V<®,2  -  °l3°32^>«  -  MJ82,"ll(B21,i  *  Gl*^>n  *  4Cl‘o 

*  ^^022o]  y  Dn[DJiaGy1  ♦  «$«„]  *  c|z0>1  1 

t  ±<(I  -  KS^)-l((B21^  y  *  ^C3zo  +  «^V01. 

<r22  -  2F33DJ2^)(I  -  Ksji^r1!  <d21«5  ♦  g1s25{)y1  +  ^c*z0  *  a|a|G2*0l 

-  2F23‘ °31S?T1  *  ‘ft'o1  -  2ClV2  +l<D3l‘!’'l  +  SJCk 

*  D32S2(I  *  Ka2S2)'11  <021S?  *  GiaiSX  *  52Ck  +  *2^V1- 
F3j(°3lX  +  aiC32,'  +  r33D32S2(I  ‘  “ 2 <D2iaI  +  G1SX',1  *  S2C2zc 

*  fi2a2G2Zo  ~  2C3zo>3  "  L1(V  *  <3.13) 


5Ll(Vhl)  *  E{<h1(z1),  ll(z1, 


(3.14) 


where 

Zl(zl’  Yl(zl))  •«^Yi<*i)  *  (D12a2  +  S&2a2)k3(z2)  "  kl(2l> 

-  +  aiaiG*)  (I  -  K&^)-1{k2(z2)  -CF22-G4&2a2)k3(z2)}  (3'15) 


and 


»  •  I  -  *1*1*1  -  <012*2  +  G3*2*2,a  -  KS2S2rl(tl21*l  +  Giai*l> 
-  *2<D2l*5  *  °ia!a?,*(I  -  “2a2)_l  '<04*2*2  '  F22> 

.«  -  *  »J2«?  +  C^4!1 

A1  "  D13D31  +  D3 1D 13  *  D31F33D31 


(3.16a) 

(3.16b) 
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<3  " 

D13 

°32  + 

*  1*  *  1 
D31F23  "  D31F33D: 

32 

(3.16c) 

G4  ■ 

F23 

°32  + 

* 

*  1  *  1 
D32F23  "  D32F33D: 

32 

(3. 16d) 

kl(zl) 

A 

m 

C3E[i 

5olZll  +  t°13c33  + 

*  1 
°31C3  - 

0*3ir33C31E(Elsoll3l 

K1 

(3.17a) 

k2(z2) 

A 

m 

C2E[: 

colz2]  “  (D32C3  + 

F23C3  - 

44C3)E'E"oh> 

lZ2J 

(3.17b) 

k3(z2) 

A 

m 

(I  - 

K^)"^C2zo  + 

o'  ' 

(3.17c) 

This  result  is  obtained  through  some  routine  but  cumbersome  manipu¬ 
lations,  and  in  the  course  of  the  derivation  we  also  obtain  the  second 
Gateaux  variation  to  be 

52L1('Y1>  hx)  ■  E{<h1(z1),  +  <^(2^,  (I  -  ^)h1(z1)  >1}  . 

(3.18) 

Mote  that  both  and  are  self-adjoint  operators  land  so  are 
2  3 

and  G^^S^,  and  hence  the  operator  It  is  self-adjoint,  mapping  into 
itself. 

The  counterpart  of  inequality  (3.3)  is  also  valid  here,  and  by  arguing 
as  in  step  2,  we  arrive  at  the  set  of  necessary  and  sufficient  optimality 
conditions 

6L1(V  hl}  *  °*  42W  V  >  0  V  hi  £  ri.  (3.19) 

which  readily  leads  to  the  necessary  condition  [see  (3.14)1 


i 


j  -**•  ■'*#.*/  m  atr  .•-»  4^4 
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2-1  (2=1 ,  ■  0  »  a.e.  pz 


(3.20) 


where  P  is  che  probability  measure  induced  under  the  weak  random  variable 
Z1 

z^.  Let  us  rewrite  (3.20)  in  a  more  appealing  form,  which  is  the  counterpart 
of  (3.6)  for  DM1: 


?  Uj)  ■  (I  --5)y1(z1)  +•  k^z^  +  (d^^2  +  G3&2a2')k3^z2^ 


(3.21) 


+  aJ(aiD21  +ai^Gi)(I  -  Ka2^)-1{k2(z2)  -  (F22  -  G4&2&2)k3(z2)} 


This  is  a  linear  equation  in  y^,  and  we  now  seek  conditions,  independent 
of  the  probabilistic  structure  of  the  problem,  under  which  it  will  admit  a 
unique  solution  in  r«.  The  general  existence  condition,  which  also  enables 
us  to  compute  the  solution  recursively,  as  in  Proposition  1,  is 


f 


rL(I  -  £r)  <  1  (3.22) 

where  r^(*)  defined  by  (3.9).  However,  this  is  dependent  on  the  statis¬ 
tics  of  the  weak  random  variables  involved,  and  therefore  is  not  precisely 
the  condition  that  we  seek.  By  appropriate  manipulations  on  (3.22)  [details 
of  which  are  included  in  the  Appendix] ,  we  are  able  to  obtain  a  "deterministic" 
condition  [C2,  given  below]  which  insures  satisfaction  of  (3.22).  Before 
presenting  this  condition,  and  the  general  result  as  Theorem  3.1  below,  let 
us  introduce  the  following  expressions: 


Preliminary  definitions  for  Cond  it  ion  _C2_  and  Theorem  3.2 

,<">  -  t1(0u<K)"01  -  -  4«*>nlF]2°21  -  0*jl 


*  .  *  n-1  *  n-1 

+  d21(k  )n  g4g1  +  d21g4(k2)  d21  (3.: 

*  n“^  n_^  *  i  1  1  *  n~^  n-^  *, i  1  , 

-  D,.  r  l  (K  )1F,,(K)JD-.  -  D„  l  Z  (K  )  G, (K)JD..] 

21  i-1  j-1  22  21  21  i-1  J-l  “  21 

i+j -n  i+j -n-1 


(3.23a) 


£ 

« 

i 

t 

5 


i 
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a$n)  -  r1[D12(K)nG1  -  D^F^OC)1^  -  D*t  (K*)11!  F^Gj,  -  C31 


it  it  n  — 1  *  ★  n  —  1 

+  d21(k  )n  1g,d21  +  D21GV'K  )  \ 


(3.23b) 


*  n_^  n-^  *  i  I  i  *  n-^  n-^  *  i  1  , 

D  z  :  (k  )  ?;n(K)Jc  +  D  z  :  (k  )1g,(k,)jg.1 
-L  1-1  j-i  "  "  l-i  j-i 


i+j  -n 


i+j»n-l 


i^n)  -  r1[G3(K)nD21  -  G*F22(K)nD21  -  G*(K*)"t  F*^  -  D*,] 


+  G*(G4(K)n_1D21  +  (K*)n""lG^G1) 
a-1  n-1 


(3.23c) 


n-1  n-1 


^  i4— i-  u— x  *  •  1  it  ^  it  \  i 

-  G,  Z  Z  (K  ) LF,,(K)^D-1  +  G,  Z  Z  (K  )  G,(K)JD 
1  i-1  j-1  22  21  1  1-1  j-1  4  21 


i+j-n 


i+j -n-1 


^n)  -  r2[  G3(K)nG1  -  G*F22(K)nGx  -  G*(K*)”(  ?J2G1  -  G3I 


*  n-1 

G1G4(K)n  Gl 


.  .  •  jl  1  n— 1  x  j  i  . 

+  G.(K  )n'1G,G.  -  G.  Z  Z  (K  )  F—OO^G. 
1  '  4  1  1  1-1  j-1  2"  1 

i+j-n 


(3.23d) 


*  n-1  n-1  *  4  i  , 

+  G  Z  Z  (K  )JG,(K)-1G  ] 

i-1  j-1 
i+j -n-1 


r  -  r1(A1)  +  r1(D12D21  +  D21Z>12  -  D^F^D^) 


+  2r1(D12G1  -  D21[F22G1  -  G^  )  +  ^(G^  +  GjG3  -  GjF^Gj.) 


+  2r1(D21G4G1)  +  r1(D21G4D21)  +  r^G^Gj) 


.  ^r-(n)  ,  (n)  ,  (n)  (n), 

+  ^  lax  +  a->  +  a3  +  a4  ' 

n-1 


(3.24) 


Condition  C2 


r  <  1 


Theorem  3. 1  Assume  conditions  £1  and  C2  to  hold  true. 

o 

i)  Equation  (3.21)  admits  a  unique  solution  e  r^,  which  is  also  the 
unique  solution  satisfying  (3.19);  hence,  the  decision  problem  admits  a 
unique  hierarchical  equilibrium  solution,  given  as  fy°,  y°  »  Tj (y°) , 

Y 3  *  T3(Yl,  7 2 )  J • 

ii)  The  unique  equilibrium  strategy  of  DMl  can  be  obtained  as  the 
convergent  limit  of  an  iterative  procedure  applied  to  (3.19),  which  is 
similar  to  the  one  of  Proposition  3.1  (ii). 

Proof.  See  the  Appendix  (Section  6).  The  iteration  is  defined  explicitly 
by  (6.1).  = 

Remark  3.2.  The  operator  £,  which  plays  a  crucial  role  in  the  solution 

2  3-1 

of  (3.21),  depends  on  some  inverse  operators,  specifically  on  (1  - 

and  its  adjoint.  However,  under  £1  each  of  these  operators  can  be  determined 

as  the  convergent  limit  of  an  infinite  series  [cf.  Remark  3.1,  relation 

2  3 

(3.12)],  and  moreover,  since  r_(K£0£.)  <  1,  a  finite  truncation  of  this 

/  L 

2  3-1 

series  will  provide  a  fairly  good  approximation  to  (I  -  •  Hence> 

the  linear  bounded  operator  can  be  written  in  the  form  of  infinite  sums 
of  products  of  "deterministic"  linear  bounded  operators  and  some  conditional 
expectations  [to  see  this,  simply  substitute  (.3.12)  into  (3.16a)],  and  for 
practical  purposes,  in  the  form  of  finite  sums  of  products  of  such  operators. 
Therefore,  in  principle,  the  iterative  procedure  of  Theorem  3.1  (ii)  can 
be  carried  out  routinely,  by  performing  a  series  of  linear  operations. 

This  discussion  also  readily  leads  to  the  following  structural  result 
(given  in  Corollary  3.1  below)  in  the  case  of  Gaussian  distributions.  3 

Corollary  3.1.  If  the  weak  random  variables  z^,  z^,  Z2  and  z^  are  jointly 
Gaussian  distributed,  the  unique  hierarchical  equilibrium  solution  is  affine 
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in  che  available  information;  that  is,  there  exist  linear  bounded  operators 

(i  =  L,  2,  3)  and  "deterministic"  functions  s^  e  S^  (i  *  1,  2,  3), 

such  that 


o ,  \ 

Yi<zi> 


X.  z.  +  s . 

ii  i 


(i  *  1,  2,  3) 


(3.25) 


Proof.  In  view  of  the  discussion  of  Remark  3.2,  this  result  follows  from 

che  following  two  properties  of  Gaussian  weak  random  variables  [cf.  Balakrishnan 

(1976)]. 

1  2 

(a)  If  z.  and  z.  are  two  Gaussian  weak  random  variables  which  are 

l  i 

H^-vaLued  and  cr  ^-measurable,  and  A^  and  are  two  "deterministic"  linear 

bounded  operators  with  che  same  range  space,  mapping  o^-measurable  H^-valued 

1  2 

variables  into  c^-measurable  variables,  then  the  sum  A^z^  +  A^zi  is  also 
a  Gaussian  weak  random  variable. 

(b)  If  z^  and  z^  are  two  Gaussian  weak  random  variables,  with  the 

latter  being  H. -valued,  we  have  E[z.  Tz.]  *  A.  ,z.  +  a . . ,  for  some  linear 
J  J  ij  J  ij 

bounded  operator  A. .  and  for  some  a. ,  g  H.. 

ij  ij  J 

Repeated  utilization  of  these  two  properties  on  the  right-hand  side 
of  (3.21),  for  a  fixed  y^(z^)  taken  as  an  affine  function  of  z^,  leads  to 
an  affine  function  of  z^  on  the  left-hand  side  of  (3.21).  Since  the  iteration 
converges  for  any  starting  choice  of  y^,  the  statement  of  the  Corollary 
follows  as  a  special  case  of  Theorem  3.1.  C 


4.  A  SPECIAL  CASE,  AND  A  NUMERICAL  EXAMPLE 


Condition  C2,  which  involves  some  straightforward  but  rather  cumbersome 
operations  in  terms  of  linear  operators,  simplifies  considerably  for  an 
important  special  class  of  decision  problems.  Specifically,  consider  the 
case  when  each  decision  maker  interacts  (through  his  cost  function)  only 
with  the  closest  decision  maker(s)  in  the  hierarchy.  In  such  a  case,  J , 
will  not  depend  on  an<*  ^3  wiH  not  depend  on  y^,  but  will  in  general 
depend  on  both  and  T^.  Therefore,  we  now  consider  the  decision  problem 
described  by  the  cost  functionals  (2.1)  -  (2.2),  but  with  the  terms 
corresponding  to  the  operators 


D  F  F 
u13’  *33*  *23 ’ 


and  C 


3 

1 


deleted.  Furthermore,  we  assume  that  there  is  no  coupling  between  the 
decision  variables  of  DMl  and  DM3,  in  the  cost  function  of  DM2.  Then, 
the  expression  corresponding  to  (3.24)  becomes 

r  *  r1^12^2l  +  ^21^12  ”  ^21^22^21^  +  F-j-jG,  ) 


'  1 '12  1 


21  22  v 


+  rl(G*F22V  +  tl">2ltF22<lt,“021  + 


n-1  n-L  ^  i  \ 

+  I  2  (K  )  F  (K)JD,.  }]  , 

t-1  j-1  “ 

i+j«n 


(4.1) 


wtyich  can  be  obtained  by  basically  following  the  steps  (of  the  Appendix) 
that  led  to  (3.24).  Hence,  what  replaces  C2  is 

C2'  7’  <  1  , 


under  which  (and  also  under  Cl)  the  statement  of  Theorem  3.1  is  valid  for 
this  special  class  of  decision  problems. 
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As  an  illuscracion,  consider  che  case  when  all  spaces  are  one- 


dimensional 

Euclidean 

,  and 

various 

paramecers  assume 

che  values 

°12 

“  °21  " 

D23 

‘  °32 

1 

■  — 

4 

F1 

2 

_  r*  *“  _ 

p2 

„3 

1 

F22 

F11 

F33 

r  22 

8 

r\ 

.  .1  . 

F1 

-  rl 

3  3  2 

°13 

F33 

F23 

C3 

°31  F11 

F12  C1  F13 

Condicion  Cl  cakes  che  form 


r2(K)  -  K  -  15/128  <  1 

and  cherefore  ic  is  sacisfied.  For  C2 1 ,  on  Che  ocher  hand,  we  have 

_  15  ,  3  15  1  “  ,15.  “  , 

12  8  64  (128}  128  (128J  \  7\ 

n*2 

-  0.1171875  +  0.0054931  +  0.0010728  +  0.0001005  + 

+  0.0000088  +  0.0000006  +  .  .  .  . 

*  0.1238633  (co  che  nearesc  7  figures) 

<  1  , 

which  indicaces  chac  ic  is  aLso  sacisfied,  and  by  a  comforcably  wide 


margin. 
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5.  EXTENSIONS  ID  MORE  GENERAL  TYPES  OF  DECISION  PROBLEMS 

The  general  method  of  derivation  introduced  in  this  paper  has  been 
discussed  within  the  context  of  3-person  stochastic  decision  problems  with 
linear  hierarchy,  but  it  is  equally  applicable  to  similarly  structured 
stochastic  decision  problems  with  more  than  3  decision  makers  and  again 
with  linear  hierarchy.  A  repeated  application  of  the  techniques  utilized  at 
step  3  in  Section  3  (and  also  in  the  Appendix),  by  starting  at  the  bottom 
of  the  hierarchy  and  moving  upwards,  would  lead  to  a  set  of  conditions 
(one  for  each  level  of  hierarchy)  which  involves  multiples  of  infinite 
convergent  series,  under  which  the  hierarchical  equilibrium  solution  would 
uniquely  exist.  Furthermore,  a  natural  counterpart  of  Corollary  3.1  would 
also  hold  for  this  more  general  problem,  in  the  sense  that,  under  jointly 
Gaussian  statistics,  all  decision  makers'  optimal  strategies  will  be  affine 
in  their  static  observations. 

Yet  another  possible  extension  is  to  multi-person  stochastic  decision 
problems  wherein  more  than  one  decision  maker  operates  at  each  level  of 
hierarchy.  Then,  the  single-criterion  stochastic  optimization  adopted 
in  this  paper  at  every  level  of  hierarchy  will  have  to  be  replaced  by 
multi-criteria  stochastic  optimization.  In  particular,  if  the  mode  of 
decision  making  is  noncooperative,  and  the  decision  makers  (at  the  same 
level  of  hierarchy)  adopt  the  Nash  solution  concept,  a  blend  of  the 
techniques  of  this  paper  and  of  Ba^ar  (1978)  can  be  employed  to  obtain  the 
hierarchical  equilibrium  solution  and  the  conditions  under  which  it  exists 
and  is  unique.  If,  however,  the  decision  makers  at  the  same  level  of  hierarchy 
act  as  members  of  a  team  (with  a  single  objective  functional),  then  the 
theory  of  Radner  (1962)  will  have  to  be  used  as  a  supplementary  technique 
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6.  APPENDIX 

In  this  appendix,  we  provide  a  proof  for  Theorem  3.1. 

We  have  already  shown  in  the  discussion  that  precedes  the  statement 
of  the  theorem,  that  the  first  condition  of  (3.19)  is  satisfied  if  and  only 
if  (3.21)  admits  a  solution,  and  that  this  solution  exists  and  is  unique 
if  (3.22)  is  satisfied. 

Taking  this  as  our  starting  point,  we  first  substitute  (3.12)  in 
(3.16a)  to  obtain 

I  -  *  -  +  (D1242  +  G3e2^)]i  +  e  (K e^e23)Q](D21e21  +  g&\) 

+  62^1^21  +  ®1£1G1^  1^-  +  ^  (^2^2^  )  3  C(®4.®2®2  "  F22^ 

[I  +  Z  (K£2tt2 )*]  (D2ie^  +  G +  D*26^  +  G*6*eJ}  , 
n-1 

where  we  have  utilized  the  following  property  of  adjoints  of  conditional 
expectations 

«  62^2  “  e2e2  ‘ 

Now,  multiplying  out  several  terms,  and  collecting  some  common  terms  together, 
we  arrive  at 

I  _  *  “  Al6iei  +  ^D12D21  +  D21D12  "  D2lF22D2l*6l£l 

+  [Di2Gl  '  D21(F22G1  "  G3)iei6l6l  +  *G3D21  +  GiD12  “  GlF22D2lJ £16161 

*  ★  ic  1  L  3  2-3  _  *  -1-2 -3-2 

+  (G3Gl  +  G1G3  -  'iW'lW  +  D21C4D2ieiWl 

+  d*i0 +  <wne>M>2  +  <G4cietEi(Si!i)2 

♦  l  ♦  A3°>ei<*iei>,>+1 

n«l 


1  < 

l,  ' 


i- 


V 


A<n)e}e3(e2s3)n+L] 


where  denotes  the  linear  bounded  operator  inside  the  brackets  on  the 
right-hand  side  of  (3.23a),  and  ^ ,  A3>  A^  are  defined  Likewise  through 
(3.23b)  -  (3.23d). 

For  two  linear  bounded  operators  2  and  <2^  mapping  into  we 
have  [cf.  Kantorovich  and  Akhilov  (1977)] 


rl((7l  +  a2)  *  r1(^j.)  +  r2(a2)  ' 


and  therefore  the  spectral  radius  of  I  -  3  can  be  bounded  from  above  by 

rL(I  -  ■&)  £  ^(AjSjSp  +  r1[(Di2D2l  +  D21D12  “  D21F22D2l)£l6l^ 

+  t1[(D12G1  -  D^1(F212GL  -  G^eje2*3] 

+  rl[  (Gj_D12  -  (gxF22  '  G3^D21^®16161^ 

+  r1[(G;jG1  +  G*G*  -  g*FF2Gl)C^«2C3] 

+  rltD21G4D2iellfii£M]  +  'l^lWX6^1 

+  r^G^G^jC^sJe2)2]  +  r1[G*G4c1e3e?(e2eJ)2] 

+  z  Cr1[Ajn)eJ(e2eJ)ne2]  +  r1[A2(n)eJ(£2e J)n+L] 

n»l 


+  rl[ft<n>e{(e]521),,+1j  +  )  . 

Now,  utilizing  the  line  of  argument  that  led  to  inequality  (i)  in  the  proof 
of  Proposition  3.1,  we  may  readily  factor  out  the  projection  operators 
corresponding  to  conditional  expectations,  and  thus  obtain  the  final  bound 


/•p2p.3vn+li 


r^I  -  3)  S  r 


25 


where  r  is  given  by  (3.24),  and  we  have  used  Che  property  thac 
★ 

r^C^)  ■  r )  for  any  linear  bounded  operator  -•  f^.  Then,  condition 

C2  guarantees,  in  view  of  our  discussion  in  the  proof  of  Proposition  3.1, 
existence  of  a  unique  solution  to  (3.21),  which  can  be  obtained  as  the 
convergent  limit  of  the  series  generated  by  the  iterative  procedure 


;n+L)  -  (I  -  ^)v[n)  +  \  +  Wl2*l2  +  G3*2£2)k3 


(6.1) 


12*  32*  23  -1,  1  2  3 

+  eJ(6‘D2i  +  6^) (I  -  K62£2)  4 [1*2  -  (F22  -  G4eJeJ)k3} 


starting  at  any  initial  choice  e 

To  complete  the  proof  of  the  Theorem,  we  now  show  that  the  second 
Gateaux  variation  (3.18)  is  positive  definite  under  C2^ 

Firstly,  since  r  <  1,  and  £  is  self-adjoint, 


lE[<h1(z1),  (I  -  J)h1(z1)>1}J  <  E(<h1(z1),  h1(z1)>1}  , 

unless  is  the  zero  element  in  Using  this  in  (3.18)  we  have 

^  Lj.  >  E  ).<h  ^  ( z  j_ ) ,  ^^(z^)>^  ”  m  G  » 

2 

provided  that  h^  is  not  the  zero  element.  Hence,  6  is  positive  definite, 
which  completes  the  proof  of  the  theorem. 


4! 

i\ 

*! 

2! 
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THE  GAUSSIAN  TEST  CHANNEL  WITH  AN  INTELLIGENT  JAMMER 
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ABSTRACT 

Consider  the  problem  of  transmitting  a  sequence  of  identically  dis¬ 
tributed  independent  Gaussian  random  variables  through  a  Gaussian  memoriless 
channel  with  a  given  input  power  constraint,  in  the  presence  of  an  intelli¬ 
gent  jammer.  The  jammer  taps  the  channel  and  feeds  back  a  signal,  at  a  given 
energy  level,  for  the  purpose  of  jamming  the  transmitting  sequence.  Under  a 
square-difference  distortion  measure  which  is  sought  to  be  maximized  by  the 
jammer  and  minimized  by  the  transmitter  and  the  receiver,  this  paper  obtains 
the  complete  sec  of  optimal  (saddle-point)  policies.  The  solution  is 
essentially  unique,  and  it  is  structurally  different  in  three  different  regions 
in  the  parameter  space,  which  are  determined  by  the  signal-co-noise  ratios 
and  relative  magnitudes  of  the  noise  variances.  The  best  (maximin)  policy  of 
the  jammer  is  either  to  choose  a  linear  function  of  the  measurement  he 
receives  through  channel- tapping,  or  to  choose,  in  addition  (and  additlvely), 
an  independent  Gaussian  noise  sequence,  depending  on  the  region  where  the 
parameters  lie.  The  optimal  (minimax)  policy  of  the  transmitter  is  to 
amplify  the  input  sequence  to  the  given  power  level  by  a  linear  transformation, 
and  that  of  the  receiver  is  to  use  a  Bayes  estimator. 

•k 

Manuscript  prepared  June  19,  1981. 


I.  INTRODUCTION  AND  PROBLEM  DESCRIPTION 


The  communication  system  depicted  in  Fig.  I  represents  an  extended 
version  of  the  so-called  Gaussian  test  channel  (cf.  [1]),  which  also  includes 
an  intelligent  jammer  who  has  access  to  a  (possibly)  noise-corrupted  version 
of  the  signal  to  be  transmitted  through  a  Gaussian  channel.  More  specific¬ 
ally,  a  Gaussian  random  variable^  of  zero  mean  and  unit  variance  [denoted 

u  -  N (0 , 1) ]  is  to  be  transmitted  through  a  Gaussian  channel  with  input  energy 

2 

constraint  c  ,  and  additive  noise  (w  ■  w^  +  w^)  with  total  noise  variance 
l  Let  the  transmitter  strategy  be  denoted  y(*)>  which  is  an 

element  of  the  space  of  real-valued  Borel  measurable  functions  satisfying 

2  2 

the  power  constraint  Ef[y(u)]  }  £  c  .  The  jammer  has  access  to  a  noise- 
corrupted  version  of 


x  *  y(u)  +  wL  , 


(1) 


Fig.  1.  The  Gaussian  test  channel  with  an  intelligent  jaomer 


> 

This  single  variable  can  be  replaced  with  a  sequence  of  independent 
identically  distributed  Gaussian  random  variables,  without  altering  the 
results  of  this  paper. 
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denoced 


y  ■  x  +  v  , 


(2) 


where  v  -  NCQjJ),  all  random  variables  <Ja ,  w^,  w^  and v)  are  statistically 
independent,  and  a  0,  ^  a  0  and  a  >  0.  Based  on  the  observed  value  of 
y,  the  jamner  feeds  back  a  second-order  random  variable  v  =•  3(y)  Co  the 
channel,  so  that  the  input  to  the  receiver  is  now 

z  ■  x  +  v  +  w?  .  (3) 

The  random  variable  v  is  correlated  with  y,  but  it  is  not  necessarily 
determined  through  a  deterministic  transformation  on  y  [i.e.  3(‘)  is  in 
general  a  random  mapping] ;  furthermore  it  satisfies  the  energy  constraint 

7  7 

E [ v  ]  £  k“.  Let  us  denote  the  class  of  all  associated  probability  measures 
■m  for  the  jammer  by  JT  .  Finally,  the  receiver  applies  a  Borel-measurable 
transformation  S(-)  on  its  input  z,  so  as  to  produce  an  estimate  u  of  u, 
by  minimizing  the  square-difference  distortion  measure 

m 

r  2 

R(Y.  5,  *)  -  -  U]  |  v]  du(v)  .  (4) 


Denote  the  class  of  all  Borel-measurable  mappings  $(’)>  to  be  used  as  an 
estimator  for  u,  by  r„.  Then,  the  transmitter  and  the  receiver  seek  to 


minimize  R  by  a  proper  choice  of  y  •  I*t  and  5  4  r  ,  respectively,  and  the 
jammer  seeks  to  maximize  the  same  quantity  by  his  choice  of  ^  4  .  Since 
there  is  a  complete  conflict  of  interests  in  this  communication  problem, 


an  "optimal"  transmit ter- receiver -jammer  policy  would  be  the  sadd le- point 

it  it  ^  it 

.Solution  (y  4  7^,  5  4  ~r,  „  4  M^)  satisfying  the  set  of  inequalities 


The  maximin  poLicy  „  is  also  known  as  a  least-favorable  probability  measure 
for  the  jammer  [3]. 

♦ 

In  this  paper,  we  verify  existence  and  "essential"  uniqueness  of  the 
saddle-point  solution,  and  determine  the  corresponding  policies  explicitly 
and  in  analytic  form.  The  main  result  is  presented  in  the  next  section  and, 
in  particular,  in  Theorem  1.  The  structure  of  the  solution  is  different  in 
three  different  regions  of  the  parameter  space;  in  one  of  these  regions  the 
solution  is  trivial,  and  in  the  other  two  regions  (which  are  covered  by 
Theorem  1)  the  saddle-point  policy  for  the  transmitter  is  to  amplify  the 
input  signal  to  the  maximum  power  level  through  a  linear  transformation. 

The  saddle-point  policy  for  the  jansner  is  to  choose  a  Gaussian  random  variable 
(or  a  sequence  of  independent  identically  distributed  Gaussian  random  vari¬ 
ables,  if  the  input  is  also  a  sequence)  which  is  correlated  with  the  input 
signal;  the  nature  of  this  correlation  turns  out  to  be  different  in  the  two 
regions  of  interest.  For  the  receiver,  the  optimal  policy  is  to  use  a  Bayes 
estimator.  The  proof  of  this  result,  which  is  given  in  section  II,  is  rather 
involved,  and  at  places  it  requires  some  rather  intricate  arguments,  but  it 
i3  essentially  a  proof  of  Che  "verification"  type. 

Section  III  of  the  paper  includes  some  discussion  on  special  cases 
and  on  some  related  results  in  the  literature.  The  Appendix  provides  proofs 
of  two  Lemmas  which  are  utilized  in  the  derivation  in  section  II. 


This  term  will  be  made  clear  in  the  next  section. 
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II.  DERIVATION  OF  THE  SADDLE- POINT  SOLUTION 

In  this  sec  cion,  we  obtain  the  saddle-point  solution  of  Che  problem 
formulated  in  Section  I,  for  ail  values  of  the  parameters,  c  a  0,  k  a  0, 

^  1  0,  •,  a  0,  j  a  0.  There  exists,  however,  a  region  in  this  parameter 
space,  in  which  che  problem  is  trivial,  in  the  sense  that  the  jamner  has  the 
power  to  do  the  best  he  can  possibly  do,  by  cancelling  out  the  signal  com¬ 
ponent  y(x)  in  the  received  signal  z.  Specifically,  consider  the  region 

Rl-  k2  a  c2  +  +  a  , 

where  the  deterministic  feedback  policy 

3*(y)  *  -y  (6) 

is  feasible  for  the  jammer,  which  leads  Co 

z  *  w2  -  v 

and  thereby  to 

5*(z)  -  0  ,  .  (7) 

resulting  in  a  maximum  distortion  level  of 

R(Y.  4*.  S*)  *  1  • 

Note  chat  the  choice  of  any  specific  coding  strategy  is  irrelevant  here, 

3ince  they  all  lead  to  the  same  maximum  distortion  level,  under  (6)  and  (7). 

*  * 

Hence,  for  this  special  case,  the  pair  (5  ,  j  )  as  given  by  (6)  -  (7)  con¬ 
stitutes  a  (trivial)  saddle-point  solution  (and  the  only  one)  for  any 
choice  of  y  5  . 


Leaving  this  "uninteresting"  case  aside,  we  henceforth  restrict  our 
analysis  to  the  parameter  region 

2  2 

R2.  k  <  c  +  +  a  , 


which  we  further  decompose  into  two  subregions  characterized  by  the  addi¬ 
tional  constraints 


R3. 


R4. 


2  (c  +  5  )  k 
k  -  -j - “  C  +  >  0 

(c  +  5  +  a)*  1 


,  (c  +U  k 

k  -  — - +  5,  <  0 

(c  +  Cx  +  a)*  2 


The  complete  solution  to  the  problem  is  now  provided  in  Theorem  1  below, 
after  introducing  some  notation  and  terminology. 


Preliminary  notation  for  Theorem  1 


Introduce  the  scalar  parameters  *  and  t  by 


-  k/(c2  +  +  cr)5* 


t  -  1  -  {(k2  +  ?2)2(c2  +  Z1  +  o)/[k2(c2  +  i^)2]}  , 


(8a) 

(8b) 


and  let  n  denote  a  Gaussian  random  variable  with  mean  zero  and  variance 
2 

tk  ,  i.e.. 


n  ^  N(0,  tk  ) 


(9) 


whenever  t  >  0. 
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Theorem  1.  In  region  R2,  the  communication  problem  admits  two  saddle-point 

*  *  *  *  *  * 
solutions  (Y  ,  3  ,  u  )  and  (-y  ,  -o  ,  u  ) ,  where 


(i)  Y  (u)  •  cu. 


(10) 


(ii)  u  is  the  Gaussian  probability  measure  associated  with  the  random 
variable 

r^y  ,  in  R3 


v  -  3  (y) 


(11) 


(  X(l-t)”y  +  n  ,  in  R4 


where  t  e  [  0,1]  in  R4,  and  n  'v  N(0,  tk  ) . 


(iii)  5  is  the  Bayes  estimator  for  u  under  the  least  favorable 


distribution  u  ,  computed  as 


I 

I 

I 

I 

I 


1 


* 

5  (z) 


f (c(l+X)/[  (l+X)2(c2+£^)  +  X2a  +  »  1°  R3 

(  [  c/(c2  +  ?.)1:  ,  in  R4 

\  1  —  • 


(12) 


Proof .  The  proof  proceeds  in  two  steps.  We  first  establish  validity  of 
the  right-hand  side  (RHS)  inequality  of  (5)  when  u  is  determined  by  (11), 
and  then  prove  the  left-hand  side  (LHS)  inequality  of  (5)  when  y  and  6 
are  given  by  (10)  and  (12),  respectively.  Finally  we  discuss  the 
"essential  uniqueness"  property  of  the  saddle-point  solution. 

(a)  THE  RHS  INEQUALITY 

★ 

Region  R2  r 1  R3:  Suppose  that  y  is  determined  by  (11)  and  the 
parameter  values  lie  in  region  _R2  ^  R3.  Then,  the  RHS  inequality  of  (5) 
dictates  a  combined  coding-decoding  problem,  with  the  channel  output 
[equivalently,  receiver  input]  being  [from  (3)] 

z  •  (1  +  X)y(u)  +  (1  +  X)w^  +  Vv  +  w^  , 
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where  0  <  (1  +  X)  <  1  from  (8a)  ,  since  we  are  in  region  R2 .  Let 

y(u)  ■  (1  +  \)>(u).  Then,  the  problem  we  face  is  the  Gaussian  test  channel 


(1  +  X)w^  +  Xv  + 


with  square-difference  distortion,  Gaussian  channel  noise  [  with  mean  zero 

2  2 

and  variance  (1  +  X)  +  X  o  +  »  atlt*  channel-input  energy  constraint 

E{[  y(u)]  2  ]  <  (1  +  X)2c2  . 

It  is  well  known  that  this  problem  admits  a  linear  solution  [  cf .  [1]], 
which  is  that  the  best  coding  scheme  is  to  amplify  the  input  u  to  the  maximum 
available  power  level,  i.e. 

Y  (u)  -  c(l  +  X)u  ,  c  >  0  , 

and  to  choose  the  quadratic  distortion  minimizer  5  as  the  Bayes  estimator 
5*(z)  -  Elujz]  -  {(1  +  \)c/[  (1  +  X)2(c2  +  i.)  +  \2a  +  5,J  }z 

J.  *m 

which  is  precisely  (12)  in  the  region  _R2  n  R3.  Moreover,  since  1  +  X  >  0, 

u  1  * 

Y  (u)  -  ■  --  -  *?  (u)  *  cu  , 

which  is  the  same  as  (10).  Hence,  we  have  established  the  validity  of  the 
RHS  of  (5),  for  the  solution  presented  in  Theorem  1,  in  the  region  R2  ^  R3. 
Note  that,  yet  another  possible  coding  policy  for  the  transmitter  would  be 

* 

Y  (u)  *  -cu  ,  c  >  0  , 


m 


i.e.  amplification  by  a  negative  factor,  but  since  this  leads  to  the  same 
minimum  distortion  level,  ve  adopt  the  convention  of  choosing  only  the 
positive  amplification  factor  and  call  such  a  solution  "essentially  unique" . 

Region  R2  ^  R4:  Now  suppose  that  the  parameter  values  lie  in  the 
region  R2  R4.  A  similar  reasoning  as  above  again  leads  to  a  Gaussian 
test  channel 


with  the  channel-input  energy  constraint  being 

E  (t7(u)]2}  <  c2!  1  +  X(1  -  t)V  , 

—  A  % 

where  y(u)  »  [  1  +  X(1  -  t)  ]y(u).  The  Gaussian  channel  noise  has  its  mean 
zero  and  variance 


var  (w)  -  [1  +  A(1  -  O^]  25  +  A2 (1  -  t) 


a  +  tk  + 


Substituting  for  X  and  t  from  (8a)  and  (8b)  ,  respectively,  we  can  evaluate 
the  latter  expression  to  be 


var  (w)  »  S,  -  -  k  - 


(k2  +  s2)2c2  +  2(k~  *  Vc 


(c2  +  ZJ2 


(c  +  5l) 


Likewise,  the  input  power  constraint  can  be  written  as 

var  (7(u))  £  c2[c2  +  ^  -  $2  -  k2]2/(c2  +  l^2  A  a 

and  furthermore 


LO 


m  +  var  (w) 


Again  using  the  well-known  result  for  Gaussian  test  channels,  we  obtain  the 
essentially  unique  solutiorrto  be 


«*<*) 


V  (u) 


E(u]zJ  - 


-  c[l  +  \(1  -  t)1/2]u 

E[uz]  x  m  m111 

var  z  m  +  var(w) 


c 


2 

c  + 


n 


z 


with  the  latter  expression  verifying  (12)  in  region  RA.  Furthermore,  since 

1/2 

1  +  X(1  -  t)  is  nonsingular  (in  fact,  it  lies  in  (0,1]  under  the  assertion 
that  c  s  [0,1]), 

Y*(u)  -  ^ (cl ) / f  1  +  A(1  -  t)1/2J 

*  cu 


which  verifies  (10),  also  in  the  region  R4.  This  then  completes  the  proof 

of  parts  (i)  and  (iii)  of  the  Theorem,  under  the  assertions  that  a  least 

★ 

favorable  distribution  y,  for  v  exists,  as  given  by  (11),  and  t  e  [0,1]  in 
R4.  The  former  assertion  is  verified  next,  in  the  sequel,  and  the  latter 
one  is  verified  in  Lemma  2  in  the  Appendix. 

(b)  THE  LHS  INEQUALITY 

*  * 

Region  R2  D  R3:  Suppose  now  that  y  and  6  are  given  by  (10)  and  (12), 
respectively,  and  the  parameter  values  lie  In  R2  H  R3.  Then,  the  LHS 
inequality  of  (5)  dictates  the  following  optimization  problem  for  the  jammer: 

/•*  2 

max  J  d»*(v)  ’  (U> 

•OO 


where 
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Z  ■  CU  +  +■  V  +■  w,  , 

a  -  c(l  +  \)/[<l  +  1)2(C2  +  zj  +  \Z7  +  i2]  .  (14) 

Noce  chat  (13)  can  also  be  written  as 

J°  .  OB 

maxj  E  [Erjaz  -  u]  |v,y}]  d^(v)  -  mx!/e{sV  -  2azu  |v,y  Jd^v  |y)}+  1 

..SM,  *• 

where 

y  ■  cu  +  +  v. 


Furthermore,  since  is  Independent  of  u,  w^  and  v,  u  ls  independent  of 
w^ *  an<*  they  both  have  zero  mean,  the  latter  expression  can  be  simplified 
to 

E(Vv2+  «22v(cu  +  w  )  +  ar2(c2u2  +  w2  +  w2)  -  2acu2 

-  2arvu|  v,y}  dy,(v|y) }  +  1 
_00 

*  nax  M  )  I*2'*  ’  2vrr(y)J  du(v|y)  }  +  (oc  -  l)2  +  a2(»  +  *  ) 

j,  •<»  * 

where 

-r(y)  -  o£[u|y]  -  »2cE[u|y]  -  a^fwjy] 

•  £[ar<l  -  arc)c/(c2  +  ^  +  a)]  -  [a2^/^2  +  *L  +  <?)]  }y 

2  2 

*  t-*  +  «r(c  +  »)/(c  +  +  a)]y  A  py  .  (15) 

Hence,  we  may  confine  attention  to  the  maximization  problem 


J 


“X  Ev*  J  <*u.Cv|y> }  , 

j,  <M  j  3  ** 


(16a) 


L2 


which  is  in  face  invariant  under  the  transformation  c  -  -c,  and  is  therefore 

also  the  maximization  problem  (for  the  jammer)  corresponding  to  the  pair 
★  * 

(-y  ,  -5  ).  Now,  by  utilizing  the  Cauchy-Schwartz  inequality  [cf.  [2]J, 
(16a)  can  be  bounded  from  above  by 


J  £  max 


1/2 

t*  l  1  f  2  2  1/2 

i  J  ar“v  dy,(v)  +  2  J  V  d^(v)  |E  f  |rr<y)  |  }|  } 

—  CD  L  * 


and  since  „  e  M . ,  this  can  further  be  bounded  from  above  by 


J  £  orV  +  21cjEyf  I-r(y)  |2  )i 1/2 


(16b) 


But,  provided  that 


Eyah(y)|2!}  *  o  , 


this  upper  bound  is  attained  uniquely  if  we  choose,  in  (16a),  ^(v|y)  to  be 
the  one-point  conditional  probability  measure  corresponding  to  the  strategy 

v  -  3*(y)  -  -(k/[Ey{|rr(y)|2}]1/2]  n(y)  ,  (17) 

which  may  be  verified  by  direct  substitution  of  (17)  into  (16a)  and  by 
comparing  the  resulting  expression  with  the  upper  bound  (16b).  Now,  what 
remains  to  be  shown  is  that  (17)  is  equivalent  to  (11)  in  region  R2  '  R3, 
and  that  Ey  £  J  -T(y ) |  }  >  0.  Lemma  1  in  the  Appendix  proves  that  the  coefficient 
p  of  y  in  (15)  is  in  fact  positive  in  the  region  R2  "  R3,  and  hence  the 
latter  requirement  is  readily  fulfilled.  Furthermore,  since 


[EyCMy)l  ]] 


•r(y)  • 


Ipj |var  (y) 


Xy  , 


from  (9a)  and  the  property  that  p  >  0,  the  former  requirement  is  also 
satisfied.  This  then  completes  the  verification  of  Che  LHS  inequality  of 
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i ;5> ,  and  thereby  verification  of  the  theorem,  for  the  region  R2  '  R3. 


Region  R2  '  R4:  We  now  finally  verify  the  LHS  inequality  of  (5)  when 
the  parameters  belong  to  the  region  R2  ~  R4.  What  replaces  the  maximization 
problem  (13)  in  this  case  is 


max  f  E^[— ^ - 

C  *  n 


Z  -  U]  |  v!  dy,(v)  , 


which  can  be  rewritten  as  (through  some  straightforward  manipulations) 


+  S,)‘ 


max 


x>  •  »  1 

V  /  IV2  -  2vE[^u-  w  |yj}  du(v|y)J  +  ;  +  +  -j  i 

J  ■<&  *■  r*  1  * 


(18) 


which  is  an  expression  that  is  invariant  under  the  transformation  c  -  -c. 
Now,  note  that 


£(T 


U  -  w  ly]  -  — 


y  - 


» 

*1 


+  5,  +  o 


r 

n 


and  therefore  the  maximizing  solution  is  any  probability  measure  with 
the  property 

-t  2 

J  v“  ^(v)  “  k 

-CO 

* 

Let  us  now  investigate  whether  ^  .determined  by  (11), is  one  such  measure 
in  region  R2  *  R4.  Towards  this  end,  it  suffices  to  show  that 

1/2  "> 

var[X ( 1  -  t)  y  +  -]  -  k" 

and 


I 


* 


C  «  [0,1] 
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The  latter  la  shown  in  Lemma  2,  in  the  Appendix.  For  the  former,  simply 
note  that,  because  "  ~  N(0,  tk“)  and  independent, 

var  [X(l  -  t)1/2y  +  1]  -  X2(l  -  t)  var  (y)  +  var  (') 

2  ?  ? 

*  X  (1  -  t)(c  +  5l  +  ff)  +  tk~ 

-  k2a  -  t)  +  tk2  -  k2  , 

thus  establishing  the  desired  result.  As  a  parenthetical  remark,  we  should 
mention  that  the  estimator  (12)  in  region  R2  P  R4  may  also  be  viewed  as  an 
equalizer  decision  rule  (see  [3]]  since  the  conditional  (on  j,)  risk  function 

corresponding  to  it  is  a  constant  on  ,  the  boundary  of  .  [Note  that 

in  this  interpretation,  elements  of  ^  are  the  decision  variables  of  the 

jaumier,  and  we  have  to  introduce  probability  measures  on  3M..J  Hence,  the 

★ 

minimax  (saddle-point)  property  of  5  in  R2  P  R4  can  also  be  verified  (with 

★  * 

Y  fixed,  as  given]  by  resorting  to  a  well-known  property  of  equalizer  decision 

rules  when  they  are  also  Bayes  with  respect  to  a  least  favorable  probability 

measure  [which  in  this  case  is  the  one-point  distribution  on  3M. ,  which 

1/2 

selects  the  Gaussian  random  variable  \(1  -  t)  y  +  *1] ;  see,  [4],  [5].  But, 
the  proof  given  here  seems  to  be  more  suited  to  the  problem  under  considera¬ 
tion  since  (1)  it  does  not  require  additional  probability  measures  to  be 

-  * 

defined  on  ,  and  (ii)  it  also  establishes  the  optimality  of  y  . 

Xo  recapitulate,  we  have  verified  existence  of  a  saddle-point  solution 

(10)  -  (12)  for  the  communication  problem  under  consideration,  in  the 

parameter  region  R2 .  The  analysis  also  readily  leads  to  the  conclusion 

★  ★  * 

that  in  addition  to  (10)  -  (12),  the  triple  (-y  ,  -4  ,  *  )  also  provides  a 
saddle-point  solution,  naturally  leading  to  the  same  saddle-point  value 


> 
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for  R.  The  question  now  arises  as  to  whether  other  saddle-point  equilibria 

exist.  In  region  R2  '  R3,  there  is  clearly  no  other  saddle  point,  since 

★  ★ 

the  maximization  problem  (16a)  [which  corresponds  to  both  (y  ,  6  )  and 
*  * 

(-y  ,  -5  )]  admits  a  unique  solution,  thereby  eliminating  the  possibility 

o£  multiple  saddle-point  policies  for  the  jammer . [otherwise,  interchange- 

ability  property  of  saddle  points  (c£.  [6])  would  lead  to  a  contradiction]. 

In  the  remaining  part  of  R 2,  i.e.  R2  C.  RA,  however,  the  issue  is  more  subtle. 

Since  the  maximization  problem  (IS)  is  invariant  under  different  choices 

of  probability  measures  from  ,  the  LHS  inequality  of  (5)  clearly  does 

not  admit  a  unique  solution  -  in  fact,  all  second-order  probability  measures 

2 

with  first  moment  zero  and  second  moment  equal  to  k  constitute  a  solution. 

But,  for  any  one  of  these  to  constitute  a  saddle-point  policy  for  the  jammer, 

it  has  to  be  in  equilibrium  with  (y  ,  8  ) ,  because  of  the  interchangeability 

* 

property  of  saddle-point  equilibria.  This  further  implies  that,  with  y 
* 

fixed  as  given,  5  has  to  be  Bayes  with  respect  to  that  least- favorable 

★ 


distribution.  Since  $  is  a  linear  estimator  and  all  random  variables  are 
Gaussian,  this  requires  the  chosen  element  of  to  be  a  Gaussian  probability 
measure,  and  some  further  analysis  reveals  that  (11)  is  in  fact  the  only 
such  element.  n 


f 


i 


i 


Some  of  the  expressions  derived  in  the  proof  of  Theorem  1  now  lead  to 
the  following  Corollary  which  gives  the  saddle-point  values  in  different 
regions . 

★ 

Corollary  1.  The  saddle-point  value  (R  )  of  R(y,  6,  u)  In  different 
regions  is  given  as  follows: 

★ 

R  -  1 


Rl: 


V 
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III.  DISCUSSION  OF  SOME  SPECIAL  CASES ,  AND  CONCLUDING  REMARKS 

The  general  solution  to  the  communication  problem  of  Fig.  1  has  the 

property  that  It  is  structurally  different  in  the  two  regions  of  interest, 

with  the  dividing  ’’line"  between  these  two  regions  being  a  hyperplane 

determined  by  the  allowable  power  levels  for  the  transmitter  and  jammer,  and 

the  noise  intensities  in  the  main  channel  and  the  jammer’s  wiretap  link.  In 

particular,  if  the  transmitter's  allowable  power  level  (c  )  is  larger  than 

2 

that  of  the  jammer  (k“),  we  stay  in  region  R 2,  and  if  this  difference 
is  sufficiently  large  the  jammer's  maximin  policy  is  an  additive  mixture  of 
a  linear  transformation  on  his  measurement  and  an  independent  Gaussian  random 
variable,  whereas  if  the  difference  is  small  it  is  more  likely  (depending 
on  the  values  of  other  parameters)  that  his  maximin  policy  will  be  only  a 
linear  transformation  on  his  measurement. 

If  the  wiretapping  channel  noise  variance  (o)  is  sufficiently  large, 
the  parameter  region  is  R2  u  R3,  and  hence  the  optimum  strategy  for  the  jammer 
is  a  linear  policy  -  which  may  seem,  at  first  sight,  to  be  somewhat  counter¬ 
intuitive,  since  the  information  contained  in  y  (concerning  u)  is  quite 
unreliable.  However,  some  scrutiny  reveals  that  the  jammer,  in  fact,  uses 
this  noisy  measurement  as  a  source  of  noise  in  order  to  jam  the  transmission 

channel.  This  makes  particular  sense  in  the  limiting  case  j  -*  »,  when  the 

* 

optimal  jamming  policy  is  to  choose  u  as  a  Caussian  distribution  with  mean 
2 

zero  and  variance  k“,  which  should  be  independent  of  the  transmitter  output. 
This  conclusion  for  the  limiting  case  corroborates  a  result  obtained  in  [  7  ] 
in  a  somewhat  different  context.  More  specifically,  this  recent  reference 
addresses  the  problem  of  obtaining  optimal  policies  in  the  presence  of 
jamming,  when  j  aimer's  policies  (considered  as  random  variables)  are  forced 
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to  be  independent  of  the  transmitter  outputs,  and  the  loss  function  (to  be 
min maximized)  is  taken  as  the  mutual  information  between  the  transmitter 
output  and  the  receiver  input.  In  this  framework,  McEliece  and  Stark  solve 
in  [  7  ],  as  an  application  of  their  general  approach,  the  communication 
problem  depicted  in  Fig.  1,  but  without  the  tapping  channel,  and  arrive  at 
Che  conclusion  that  Che  least-favorable  distribution  for  v  is  a  Gaussian 
distribution.  Hence,  the  two  seemingly  different  problems  (with  different 
loss  functions  —  square-difference  distortion  and  mutual  information) 
admit  the  same  saddle-point  solution  in  the  presence  of  an  independent 
jammer  strategy. (This  equivalence  can  in  fact  be  verified  directly  by  making 
use  of  3ome  inequalities  of  Shannon  [  8  ]  on  mutual  information. I  3ut,  this 
equivalence  does  not  directly  extend  to  the  communication  system  considered 
in  this  paper,  and  derivation  of  the  saddle-point  solution  of  the  communi¬ 
cation  system  of  Fig.  1  when  the  loss  function  is  taken  as  the  mutual  infor¬ 
mation  between  u  and  z  remains  today  as  a  challenging  problem. 

There  exist  quite  a  few  results  in  the  literature  on  worst  case  designs, 
wherein  the  Gaussian  distribution  has  been  proven  to  be  the  least-favorable 
distribution  (such  as  the  cases  of  entropy  maximization  [  9],  Fisher-information 
minimization  [  10],  or  minimax  estimation  problems  [11],  [  12]),  and  the  present 
paper  adds  to  this  li3t  a  new  class  of  problems  not  considered  heretofore. 

T.Je  should  note,  however,  that  if  the  input  sequence  in  Fig.  1  is  vector¬ 
valued  and/or  the  number  of  channels  is  more  than  one,  the  saddle-point 
solution  will  no  longer  be  linear-Gaussian  (i.e.,  the  solution  of  this  paper 
does  not  carry  over  to  the  vector  case)  ,  since  the  counterpart  of  the  Gaussian 
test  channel  does  not  admit  a  simple  linear  coding  scheme  in  the  vector  case 
[131. 


IV.  APPENDIX 


In  this  appendix,  we  provide  proofs  for  the  two  Lemmas  which  were  used 
in  the  proof  of  Theorem  I  in  section  II. 

Lemma  1 . 

2  o 

p  *  -a  +  (a(c  +  aa)/(c“  +  5^  +  o)]  >0,  in  R2  n  R3 , 
where  or  is  defined  by  (14). 


Lew™ -=»  2 . 

t  c  [0,1]  in  R2  ^  R4  ,  where  t  is  defined  by  (8b). 
Proof.  Starting  with  the  inequality  that  determines  R4 , 
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2  (C  *  V 

k2  +  5  2  1  -5 - h:  k 

2  (c  +  5L  +  «)* 


(k2  +  52)(c2  +  +  o)1  <_  (c2  +  5X)  k  , 


and  squaring  both  sides 

(k2  +  52)2(c2  +  5X  +  a)  _<  (c2  +  SL)2k2 


we  arrive  at 


(k2  +  C2)2(c2  +  +  o) 

(c  +  5X) V 


<  1 


Since  this  latter  expression  is  equal  to  1  -  t  [from  (8b)] ,  and  it  is  also 
positive,  the  desired  property  follows.  = 
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WITH  -NCERTAIN  RATE  FUNCTIONS* 

Evj^eloa  A.  leraniocis  ind  H.  Vincent  Poor 
Department  it  Electrical  Engineering  and  cne 
looccmatea  Science  Laboratory 
Tm varsity  >f  Illinois  ic  Jrbana-Champai gn 
.roana,  Illinois  oUOi 


Abstract 

This  paper  crises  the  following  decision 
proolema  for  continuous- cine  ays  cams  with  iiscon- 
tinuous  observations  <i.a.,  for  systems  observed 
mrougn  point  procassaa  >  - 

I.  Robust  aaccned  filctring. 

II.  Robust  Wiener  filtering 

III.  Mimmax  suea  estimation  for  sviteas  with 
noisa  uncertainty . 

In  e sen  case  there  is  issusiad  co  oe  sons  legree  it 
ir.cartamty  in  the  race  function  of  in  observed 
Pcisson  process,  ino  i  corresponding  -Unimex  :esign 
pnilosoonv  is  laopceo.  In  Problem  I  we  assume  thee 
-ha  race  or  me  oosarvacion  process  is  a  iecerain- 
i s lie  function  of  line,  and  m  Prooiema  II  and  III 
we  lssuma  mac  mesa  races  ire  vide-sense-s tacion- 
iry  stochastic  processes.  General  solutions  to  the 
three  oroolems  ire  considered  in  terms  of  least* 
favoraola  rata  functions  or  processes,  ind  several 
useful  -aoceis  of  incertamtv  ire  clscussed  in  tns 
context . 

Introduction 

The  purpose  of  this  paper  is  to  formulae*  and 
lol/a  three  minimax  lecision  probLems  for  concinu- 
a us  -time  systems  ooserveo  through  point  processes. 
In  particular,  for  Poisson  ooservatlons  with 
-ncertain  races,  we  consider  the  proolema  of  robusc 
natened  filtering,  roousc  Wiener  filtering,  and 
unimax  state  estimation.  All  of  these  problem# 
nave  oeen  considered  in  the  context  of  continuous 
observations.  In  particular,  for  continuous  obser¬ 
vation  models,  roousc  mscched  filesring  has  oeen 
considered  in  '1-4],  robust  Wiener  filtering  in 
.5-4!,  end  mniaax  state  estimation  in  [9-10!. 

Here,  we  apply  the  methodology  of  these  earlier 
works  to  consider  these  problems  for  discontinuous 
jo serve cions . 

In  Section  I,  the  robust  matched  filtering 
problem  is  considered.  We  see  here  that,  although 
cne  probLea  formulation  and  its  solution  for 

uncertainty  resemble  that  of  the  corresponding 
proDlam  for  continuous  signals  [3],  the  dlscontin- 
jous  prooiem  is  not  directly  transformable  co  the 
continuous  one.  Furthermore  cne  special  signal 
structure  «  the  nonnegativicy  of  the  race  function') 
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u  the  disconcinuous  obccrvccions  :tli  allow*  us 
;o  consider  for  ch*  signal  iraca)  any  uncertainty 
nodal  dascrioed  by  2 -alcarnacing  capacicias  (,*.g. 
concaminacad  mixtures,  total  /arlacton  nelgnbor- 
hoods,  band  models,  and  extended  p-polnc  monels 
< »*•  [22)));  chit  cannot  be  don*  for  ch*  conclnu- 
out  observation*  caaa.  therefore,  collet*  re¬ 
in*  lysis  of  this  problem  is  required  acre.  Sec¬ 
tion*  II  and  III  treat  tn*  ralated  problems  of 
robust  Wiener  filtering  and  mini max  state  estima¬ 
tion,  respectively.  la  contrast  to  the  mateneo 
filtering  problem  of  Section  I,  ic  is  seen  thsc 
these  two  problem*  can  be  trsnsformaa  ulrtcciy  to 
malogou*  continuous- time  problams  'a*  creacad  in 
’ 3  1 , ; 10 ] )  and  thus  are  solvabl*  using  tech¬ 
nique*  of  pravious  studies .  For  tnese  problems 
j*  present  an  approach  thac  treats,  in  a  unified 
v*y,  soma  important  models  for  uncertainty  la  the 
racas  fe.g.,  band  models  and  eoncaaineced 
mixtures  1  . 

I.  ROBCST  MATCHES  FILTERING 
A.  Syscsm  Modal 

Consider  tha  phocodectccor  ispicted  in  Fig.  1 
vnlch  is  usad  for  fiber  ana  free  apace  opclcal 
coeBinlcatlon  systems  isaa  11)1  and  vnlch  allows 
us  to  emphasize  different  noise  contributions  The 
ouc^uc  of  the  phocodectccor  is  given  by  the  sum  of 
a  filtered  Poisson  process  i,(t)  plus  an  indepen¬ 
dent  zero-mean  thermal  noise  l  .  (e).  The  currenc 

in 

i#<  t)  can  be  expressed,  for  c  a  3,  as 

N 

V  M  'C  )  a.l) 

n«l 

where  [S.,n  0]  Is  an  inhomogeneous  counting 

process  such  thac  N  is  ch*  number  of  pnocoelec- 

crons  gtnerscsd  during  [0,0,-  is  the  emission 

n 

tl3*e»  sad  che  g  *s  ere  t.d.  random  variables  that, 
n 

in  avslsnch*  phocodiodss  (APD’s),  model  the  number 
of  secondsry  electron*  gensrsced  for  esch  primary 
phocoelsctron.  Hers  *  is  ch*  charge  of  an  elec¬ 
tron  and  cht  APD  impulse  response  Is  assumed  to  be 
gc  <  c)  where  «  Is  me  Otrac  deles  it.*.,  tn*  photo¬ 
diode  is  assumed  to  be  ideal).  This  is  not  a 
major  rescrlcdon  and  tha  results  thac  follow  ssv 
be  modified  for  the  general  case  In  which  the 
impulse  response  is  gh^'t)  with  h(  nonimpulstv* . 

The  tncenslcy  *. .  of  n  is  related  to  ch* 


I  Vtizi  «->t  X.’.d  .'citf  :i,  id.;  r<.C.Jc,  SA, 

*.'tc  '-..cccs.t.!.ij  .  •  *ic  '.•C'.w-c.tcc. 


t 
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liicldanc  Jpcical  ?ov«r  ov 

■:*  r  ll-*> 

•jn«r«  "  is  ir.«  :uancum  4;flci«ncy  jf  ziit  ana 
>,  ia  in*  energy  m  i  anocon,  viz.,  n  is  Plank's 
:jnsC4AC  ana  ii  :n«  inaoaulacta  ^pcical  carrier 
frequency.  >«  assurat  "hac  p  <  - )  is  -it  car^inis  rlc 

ina  rhus  >0  is  '•  ■  *  itcinta  in  il.I1.  The  race 

j  a 

accounts  far  :ne  constant  race  ac  vhicn  aponcant- 
3us  auc  ixcrantous  e Lee  crons  art  generated  curing 
I ,  c  -  .  As  a  consequence  i.it:-  is  af  recced  ay  a 

aiscurbance  calieu  che  cark  carrenc  vnich  is 
usually  negligible  provtaed  ?(Z)  is  .noc  coo  small. 

The  filtered  Poisson  process  ac  the  output  of 
the  receiver  filter  see  Fig.  i)  j w  can  be  written 


'  n-l  n 


i  r  >h^c  --  *4  •  . 


In  12,  ?p.  108-1791  the  characteristic  function, 
cne  cuasilancs,  and  thus  the  moments  of  y  were 

evaluated.  In  particular  the  a ignal* to-aoise  ratio 
at  the  output  of  the  receiver  is  given  by 

-  ”  y .  .  •  ^ .  i  _r  *'*•01  _  2 


n «  t  -  -  • '  >  -  ;  dr 


■»"  t  -  *  ’*  ur-  -  ’  ■  ♦  *  -  ]  h“<  :•*  >4- 

■5  5  2e"sr*')  J'o 

a..» 

-ne r«  2  i*  che  cvo-siaed  speccral  iensicy  of  che 
cr.« rail  noise  proem  wnicn  it  niuua  co  o«  .nice 

Russian,  iru  the  -nonanes  E(g)  and  £ig"">  of  che 
gain  at  cne  APS  vara  evaluated  in  131;  for  our 
prooian  cnese  ara  given  constants. 

The  assumption  in  (l..)  spout  cha  channel  noisa 
being  a  white  Russian  procaas  is  not  rastrictiva 
unci  this  is  a  raalistic  enough  nodal  for  our 
syscem;  notice  chat  in  a  photon  cnannal  cha  main 
source  of  ’noise"  is  cha  shot  noise  involved  in  cha 
counting  process  N  .  Therefore,  in  what  follows  wa 

will  Pa  primarily  interested  in  cha  "signal" 
process  ■  (”  ) . 

1.  The  Matched  Fllcer 

.is  proved  in  (12,  pp.  168-1721,  aa  certain 
parameters  tend  co  prascrlPad  limits,  cha  process 
yiCi  rdaflnad  in  (1,3))  tends  to  a  Gaussian  process 
on  3,t)  vith  naan  the  unsquared  numerator  of  (1.4) 
and  variance  cha  denominator  of  (1.4).  Tharafora 
the  probability  of  error  reduces  Co  3  (SNR)  (where 
J  is  cha  call  of  cne  standard  Gaussian  discribu- 
ciani  and  5NR  becomes  a  useful  performance  measure. 
Thus,  maximizing  cha  SNR  over  all  possible  fllcer 
impulse  rasponaaa  Is  desirable.  The  resulting 
optimal  filter  is  cha  matched  fllcer  - 

Lee  s  stand  for  ■•(*>,  h  for  hit-'),  and 


n  *  ,  ,  +■  *  .  Let  •  ba  cha  i.nnar  prod- 

W  ,  -  fc.  d 

3a  c(g  ) 

uct  defined  for  real  a,b  as  '  a,b  •  a(-)b(-)dT. 

"  3 

Then  n  r  Z,  s  i  Z  »  [a:*  z  Z,  a  z  )}  where  Z  is 
L"(0,e)  and  SNR  of  (1.4)  may  be  written  as 
SNR  -  Z~  < i)c  (h;s,n M>/E(g")  wnare 


c (h ;s ,n) 


'  h,  (s  -  nth . 


cor  any  n  z  S,  cha  apace  of  bounded,  linear,  posi- 
cive  operators  mapping  Z  co  icself.  Tor  cha  con- 
clnuoua-tisa  casa  chat  we  consider  in  this  paper, 
n  is  Che  autocorrelation  function  of  che  zero- 
mean  Gaussian  thermal  noise  process  and  it  is 
generally  of  che  form  )<e  (»  ,J  )*>..:  (*  -•?).  In  this 

general  case  s  *  n  of  (1.5)  ruse  ba  Interpreted  as 
s(*H(»- G)*n(-,3).  If  in  particular  n(T,d)  ia 
of  che  form  n(r)6(-  -?)  (l.a.  che  noise  process  Is 
timewise  uncorre laced)  chan  s-n  of  (1.5)  is  just 
s(*  i  *■  n(- ) ;  the  white  noise  casa  chat  we  creaced 
first,  above,  is  a  special  case  of  this  for 

n(-  )  •  n  for  ell  -  . 
w 

The  matched  fllcer  problem  for  fixed  »  and  n 
is  given  by  (compere  with  r31) 

max  p(h;s,n)  (1.6) 

h-I 

The  solution  co  chis  problem  for  fixed  a  is  given 

by 

Property  l  CMacched  Filter'': 


max  :  (h;s,a)  •  :  ( (s  -  ni  s  ;s  ,n>  •  1  s ,  (s  *  m  a 

(1.7) 

- 1  - 
•where  (sm)  is  che  inverse  mapping  in  Z  corre¬ 
sponding  co  s  -  n  (recall  s  a  3  and  i  U  i  posicive 
operator) . 

Proof .  Follows  straightforwardly  from  cha 
Schwarz  inequality. 

We  are  going  co  need  the  fallowing  two 
properties  of  chs  functional  s  . 

Property  2:  For  fixed  h  z  Z,  :(h;s,a)  is  convex  in 
(s,n)  €  Zr  «  Z. 

Proof .  The  proof  is  similar  co  chat  of  property  2 

of  (31. 

Property  3 :  The  functioned 

max  c  (h ;s ,n)  -  (  s,  (s+n)*l-s  is  convex  in  (s,n) 
h  z  Z 

on  Z*  <  2. 

Proof.  Since  (  s  ,  (s-hi)**s>  •  j((shi)"^;s,n) 
Property  3  followa  from  Property  2. 

Remerlc_l.  In  compering  '  s,  (s*n)*ls)  of  (1.7)  with 

's.n'^s)  of  [3]  nocice  Chet  che  discontinuous 
observations  casa  Is  equivelant  co  che  continuous 
observations  cate  with  autocorrelation  function 
n(r,o)  *  s(')d  (t  -it)  (i.e.  che  useful  signal  also 
plays  cha  rola  of  additive  uncorrelated  noise). 

takl.  Let  us  assume  chat  V.  *0  (no  "dark" 


current  is  present)  and  thee  no  thermal  noise 
disturbs  che  system  of  Fig.  1.  Then  via  (1.4), 
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•'1.33  reduces  Co 

: (h;s, J3  •  h,s  "  'h.sh 
and  1.73  co 

max  d(.h;s,J>»:<l.s,J3*  s,l 
~l  r  Z 

:hac  is  eh*  opcimal  filtar  is  ehac  one  wi.cn  impulse 
response  identically  1;  m  ocher  words  che  naecned 
filtar  is  a  pure  integrator  In  chia  case. 

:.  The  Robust  Maccntd  ~1 3 cer  fee  .'ncertam 
iignal  jcruccure 

equation  (1.7)  inclcaces  chat  cor  cnown  s,n 

che  maccnad  filter  is  given  oy  H  ■  s-ni  *i.  Suppose 
now,  chat  s  and  n  are  only  renown  co  be  members  of 

c.ne  classes  V  -  Z~  and  m  Z  Z.  We  would  like  co 
find  a  filter  wnose  performance  does  noc  deterior¬ 
ate  drasclcally  over  J  .  “.  then  as  in  3!  we  say 
chat  hR  i  Z  is  robus t  over  J  •  m.  if 

inf  :  .hjis.n)  » aax{  inf  :  .h;s,n);  (1.33 
•  i,n )ne'  *  *"  “  hm Z  s ,  n  c y 

We  note  cnac  (h.  ; a.  .  n.  i  ~  <  ZT  •  Z  is  a  seed le 

point  solution  to  the  gams  of  <1. 33  if 

inf  :  '  \  :  a  ,  n i  *  ;  •  h,  ;  a,  ,  ru  '  -  max  c  ( H ;  s,  .  rv.  i . 
(i.ar:*'*  "  “  “  h  n  X  “  “ 

(1.9) 

Concerning  such  a  solucion  we  have  che  following 
result . 

'-edge  1.  Suppose  J  and  “?  are  convex, 

a.  ,.V  •  t  J  ■  *\  and  h»  *  '.a.  -  n,  i*1-!.  .  then 

,3.,.-,’  is  a  saddle  point  for  eq .  (1.33  Iff  che 
following  inequality  raids  for  all  (a.nl  V 

t'ru.j  -'h_,st  a  'h.,(s-a)h_  (1.103 

Proof.  Per  3  i  [0,1]  and  (s,n)  t  e'  •  "7  define 
-  3 )  ST  -  os , ,  •  f  1  -  j )  n.  -  an  and 

:<(or  ;s  ,ni  ■  d  Ih^’.r  )  . 

then,  since  </  and  “  are  convex  (H,  js^.n^'  1*  1 
saddle  point  for  aq.  (1.3)  iff 

<(3; s,n)  a  K(0;s,n) 

for  all  3  [0,1]  and  a  J .  Since  J  and  v  are 

linear  rune  done  of  3  and  <(cr,s,n)  Is  through  Its 
definition  and  from  Property  2  a  convex  function 
of  ( j , n )  it  follows  thee  K(or;s,a)  Is  convex  in  3 
for  each  (s,n)  J  x  thus  K(a;s,n)  i  ic(0;s,n) 
holds  iff 

:».K(a;s,<t.,*»1|a-0  »  0 

for  ill  <  s  ,a ;  -  J  <  .  On  ilf  ftrtndJClng  w«  h*v« 

'‘.K.di;s,.h),  )ai;^0 . 2' s.h^  *  (s-nHi^) 

and  learns  1  follows. 

Sow,  we  define  a  pair  (s.  ,n,  1  to  be  least 
fsvorao le  for  '  “  if 

a.,(j.-n,  Ls, '  -  din  's,(s-n)  ^s  .  <1.113 

-  "  **  L  (>,«rP.’ 


It  follow*  easily  fromEq*.  (1.73,  (1.9)  end  (1.11) 
that  (s.  ,n,  )*d»' '  Is  lesac  favorable  for*1 
-  -  - 1 

ana  “  if  <ht  ,s,  ,  a^)  where  a.  *  <s,  -a,  3  s^  is  a 

saaqle  poinc  solution  to  <"-.33.  We  also  have 
'  riay  3  Suppose  J  and  “>  are  convex,  (s,  .n,  <*! 

and  nT  •  (  s, -n^)  ” '’s,  .  Furthermore  suppose  that 

s  ,  (p  1  ''s.  Is  right  continuous  in  dr  at  3  ■  0 
for  each  1 s,j)tV  <"!  where  ,  »(l-ar)n,-an  and 
r»<l-a)s,  -as  .  then  (s^ ,  n^)<t  '  7?  is  least 

favorable  for  J  and  “  iff  (1.10)  holds  for  all 
(s  ,n)  a  J  < 

Proof.  The  proof  is  similar  to  the  proof  of  Leons 
1  and  followe  step  by  seep  che  proof  of  Lama  2 
and  the  Appendix  of  [3]. 

Lemmas  1  and  2  imply  that,  within  a  mild  con¬ 
tinuity  requiremanc  tha  triple  ih.;s^,a_)  with 

1  a,  ,  n,  )  J  '  “I  and  h,  ■  <s^  -  n,  jlv*»  che 

aesired  solucion  to  our  problem  iff  (1.L0)  is 
satisfied . 

In  the  sequel  we  find  che  robusc  matched 
ciicer  (l  for  classes  J  of  the  signal  dascrlbed 

by  L,  uncertainty  or  uncarraincy  described  by 

Ihoquet  capacities.  Although  we  could  consider 
uncertainty  in  tha  noise  (or  in  the  signal  and  tha 
noise  stnulcanaously) ,  we  rescrlcc  attention  co 
signal  uncertainty  only,  since  tha  thermal  noise 
can  ba  vary  realistically  modeled  as  white 
Oaussian  for  most  applications,  therefor*  it  is 
noc  so  res  eric  cl v*  co  assume  chat  w*  daal  with 
nonnagaclv*  autocorrelation  functions.  Also 
recall  that  the  signal  s  which  represents  a  race 
function  is  always  nonnagaciv*.  Under  chess 
assumptions  chs  continuity  requirement  of  Leona  2 
is  satisfied. 

C.l  L,  uncertainty 

Let  V  *  [n^J  satisfying  the  assumptions  above 
tad  assume  thee  the  signal  quantity  is  known  to  be 


in  the  class  -  Z*  defined  by 


-  f.s  €  sTs-Iq,:2  s  i] 

a. 12) 

where  s.  can  be  choughc  of  as  a  known 

noalnAl 

signal  and  2  >  0  as  a  degree  of  uncertainty  in  che 
noednal  model. 

Define 

*t-  (,L*  !'0)hg  *  *0  *C0hR(2  'hR) 

(1.13a) 

s5 

( 1 . 13b ) 

where  h^  satisfies  the  equation 

nQhR  -  (1 -hR)fsQ -d0h^(2 -i^)]  . 

(1. 13c) 

than  vd  have  chat  if  a  solution  co  •1.13c'1  ir  1 
1 1.13b)  exists 

theorem  1:  the  triple  .dg)  1a  1  saddle  point 


i 


solution  co  eq .  '.1.9'  for  J,  and  [n1  j. 

3  -l 

Proot .  oince  s,  and  t,  •  <  s,  -  n^  ■  s,  it  is 

sufficient  '/id  Lemma  l  co  snow  code  >1.1.01  with 
h,  *  a.  xolds  for  ill  s  -j  J  .  1:  ij  easily  seen 


that,  coca  n  : 


m*  >.  1  -  n,  ;  s .  .  e q  .  1.101 


h_  .  .,2  -  h_  '  1 1  *  i.  ■  a  1 

ana  men  from  ,  1.13a  i  and  .’.lib) 


3uc  from  in a  Schwarz  inequality,  '1.13b)  and  >1.12) 


,'hR'::'S'lS*,0  iS  V2*V  ' 

inc  ch«  theorem  follows. 

Equation  (1.13c),  which  $iv«s  ch«  robusc 
=acch«d  fllstr  for  our  problem,  can  b«  solved 
iceracively  co^echer  with  (I. lib).  Solucions  :o 
re  laced  equations  for  th«  continuous  observations 
case  have  seen  creaced  in 

3-3  .neertainev  within  capacity  classes 


In  this  section  we  consider  uncorrelaced 
thermal  noise  processes  with  autocorrelation  func¬ 
tions  or  the  form  -  r )  where  n^('  >  >  3 

a.e.  and  let  us  iefine 


v ,d* 


the  important  case  or  white  none  la  Included  In  : 

bhe  oldss  loova  *  M  l  and  *  :.'„t.  2  for 

/  J  N  j 

unite  rone  -dj.cn  tvo-sited  spectral  density  2). 

:r.  c.na  -lonnegative  signal  a  iai*i  (" ) )  we  tmpoae 
me  constraint 


icuaelon  1.151  ia  dn  average  power  constraint  "w 
ce-.oeas  average  power,  lenotea  average  energy)? 

:.'oce  :.m:  in  nose  opcical  communications  aysceme 
cne  race  function  ia  proportional  to  the  trana- 
dtted  power  '’sea  for  example  *q.  11.21). 

3efore  characterizing  the  uncertainty  claaa 
for  the  signal  we  need  soma  definition*.  Let  .3 
denote  and  7  denote  the  3or«l  r-algebra  on.*. 

Thar.  1  finite  aet  function  v  on  2”  ia  a  2-eltsrnat- 
lag  capacity  Id;  on  C.  J'  if  it  ia  increasing, 
tontinuoua  from  below,  continuoua  from  above  for 
closed  sets,  and  if  it  aadafles  yfj)*0  end 
/(A  .  3)  -  v(A  *3iS  v(A  >  -  vfB)  for  ell  A, 3  7. 

Obviously  any  finite  measure  ie  e  2-elternetlng 
cepeeity. 

For  eny  pair  (v^.y^i  of  2-elternetlng 

capacities  on  C,j7)  there  exists  e  Radon-Nlkodyn 
ierivetive  ov^'dv-j,  Introduced  In  [19],  with  the 

leftnlng  property  chat  far  each  x  0  '  0 ) 


. dv,  iv,  >  XI  - 
k  J 


d  A!  »  ll-X. 


1  •  Inf  d  (A)  where 

it-  x 

A  w 

1;xv3(ai  -  vj^.a0) ] . 


Cefina  the  finite  measures  n,  ind  a  so  thee 

(  i 

they  are  absolutely  tontinuoua  with  respect  to  the 
Lebesque  measure  ..  on.*,  with  derivatives  n^  and  a, 

respectively.  This  restriction  ia  only  for  the 
purpose  of  preserving  notatlonal  uniformity  and 
ten  oe  relaxed. j  Consider  the  seta 


:  n  a  A  -  S  /  'A)  for  all  \  r  7,  and 
s  i  a 


where  v^|A)  *  m^,  IA)  for  all  At  7,  v  -3  ;  *  ,  and  7 

is  a  2-elternating  capacity.  Rote  that'*'  la  weekly 

cos^ect  [191  and  convex.  The  following  result  is 
a  subcase  of  Theorem  L.l  of  f 191 . 

Lemma  3 ;  Let  v^  and  v^  be  e»  defined  to  (1.161- 

(1.17),  and  he  a  version  of  dv^,  iv^.  Then 


en>  ind 


(1.173 ,  acid 


:h«r«  exists  aaasure  a  -  **  such  chat  am  /dvVf 
s  s  -J  j  N 

and 

*  XJ>  *  v3('~0  K  X%1'  (l-18) 

for  all  x  t  '0,“).  Consequently  oecomes 
stocneaticelly  smeller  under  a“  over  . 

i  l  5 

Let  s^  be  the  derivative  of  a_  with  respect 

to  the  Lebetgue  measure  on  3  .  'text,  we  prove  the 
mein  theorem  of  this  section. 

Theorem  2.  ?or  m"  as  defined  by  Lama  3  lee 
L ,  i 

"j  t  um^  dv^,  in  particular  and  define 

•V  *  l.  Then  (\  ; *^.dg)  is  a  saddle- 

pome  solution  for  the  game  of  (1.3)  where  the 
classes  <7  and  1  are  dsfined  bv  *>•  [aim  t*'  and 

s  -  dm^/du],  ?!  *  {n0]  . 

Proof.  To  prove  chat  (h^;s^,nj)  la  a  saddle-potnc 

for  (1.8),  it  suffices  to  show  that  (1.10)  la  crue 
-see  Latmu  l).  In  this  case  (1.10)  reduces  to 

's  -  s  ,2h.  -h5-  a  0 

or  L  “  “ 

»(h^)sdu  a  >  (h.  }sTdu 


where  /(H^'-lh^-h^  and  h,  »(a,  -n.)  *  s.  -  -^(  1»  *  \ 

Equivalently  we  can  write 


d(t0)dmj  a  ,_:(-5)dm; 


where  p(_0)  *  1  -  ( 1  •*-’Tg)  1*  *n  increasing  func¬ 

tion  of  -g.  However,  Lesas  3  implies  thee 
becomes  a cochatcically  smaller  under  m"  than  eny 
other  mf  n  ■  This  completes  che  proof  of 
Theortm  2 . 

The  rats  function  s^  can  be  thought  as  a 
least-favorable  signal  for  minimax  matchad  filter¬ 
ing  Ve  can  actually  prove  the  following 
property. 


.inear  estimate 


Theorem  3  -  Tne  cseasure  .  satisfies  che 

-  one  .us ton  or  Lima  J  iff  i.  -  ;a"  i**  mni  mazes 

nin^  :  his.n^  •  i.  s»nr  ’  fc*  *'*“  s-nyou  .1.19' 


)v«r  ill  >  -•  *  . 
r  ^30 1  -  :  ■ 

*ar.  :  h;s,n 


•  *ty  *-ny  .  Then 
-.  1  -  f  *  i  <  m  -  .  . 


H.’.Ci  -  *  -C  '  •  ■  i  -  v. 

entiJole  on  3,1', 

t.:  :t  ir . 


_ s  convex  md  twice  iixfer- 
rheorem  J  :o iiuw*  from  Theorem 


Recall  ^\ic  .".  *  3,t;  ii  :ong>act  cor  iny 

finite  decision  time  interval  t.  Therefore  ill 
■Ji  important  classes  of  signal  uncertainty  art 
capacities  *aa  22 }  The  least-favoraole  sig¬ 
nal  s.  ii  man  raLacaa  3v  j.  *  •  ao^  aw  to  the 

least- favorable  its cri but ion  ?  of  the  equivalent 
-ooust  nypo thesis  testing  problem  of 

■  ,?  ?  •  “a  i  ,  a  -  ]  versus  0,.  •  ;u,  Ev,j 

i  i  i  3  3  5  J*  »  N  M 

n  i;  m  me  voice  -toiaa  cist)  .  The  least- 

tavoraol*  distributions  have  oaan  found  cor  the 
*: -contaminated,  total-venation  nelgnborhoods 
•ooaaLj  in  20]  ;  for  the  rand  model  in  ’21’*.  a  no 
for  me  extended  p-pomc  in  22]. 

Cl.  303UST  VIZ  NT R  "I LEERING 
Suppose  ina  crip le  r. ,  7  _  >  ^  consists  of 

i  iample  ipac*  .2,  ir.  increasing  family  ot*  r-fielcs 
7.  .  ,  mu  a  prooaoilicy  naasura  ?.  Lac  \\  oa 

*n  7^-acapceu  point  process  with  ;ung>s  if  size  j  ina 
1.  ina  compensator  *  *  is  see  ii]) 

proolem  of  Linear  '-!MSE  estlaetl on  of 
:o$ervations  N_  given  oy 

iS.  ■  „ic  -  ia„,  c  a  3 


lonslder  the 
• .  from  che 

2.1) 


itraighcforwsraly  ov 


a  .-IN 

i 


of 


is  given 


dUdV 


c ,  u  >du 


ifld  ic  rums  out  thee  cna  jorriiponoing  expression 
_ror  continuous  obsarvationa  would  be  cha  same  vitn 
■  ^  *'  u ,  •/ 1  replaced  oy  5C  (utv>.  Thera  fora  .2,1. 

can  oa  written  in  cha  equivalent  fora  equivalent 
for  linear  filter  design,  chat  is) 

ir  •  <  i t  ♦  ■  iv  ,  t  v  3  •  2 . 5  \ 

t  c  c  c 


vnara  r#  is  cha  continuous  oosarvacions  process 
and  v,  is  a  standard  zero  irift,  variance  c ' 

-Lenar  process .  This  result  Appears  in  12,  p. 

320 1  . 


For  being  vice-sense  scacionarv,  ve  have 
’■  „  ■  *  ,»s.  t  t,s  i  •  <.  i  c-i  >  and  2.;)  reduces  co 

ir "  •  *•  „  i t  -  ■ '  )  : aw „  •  _  i c  *  :w „  1  2  .  o  . 

vnara  is  white  laussian  noise  with  Ioud ie-sideo 
spectral  iansity  • 


Therefore  the  noncausal  vtaner  filtering 
error  is  ^iven  by  15,  ?.  -961  ^tiare  \  replaces 
'■•y  2.  In  particular  ve  have 


2 .  T> 


and  the  causal  Vtaner  filtering  *rror  15, p.5011, 
lb i  is  given  by 


•vnara  n  is  an  : .  *narcir.gale .  Note  that  **c;ta  )} 
mus  defined  is  cne  rata  of  the  observations . 


«  (?, 


-0' 


inf  1 


'  2  .  i ) 


The  solution  co  this  estimation  problem  is 
<novn  12,  pp.  317-320]  co  be 

•*  •  n.(,t,u)dN  (2. 2^ 

-•o’  u 

vnara  h^^t.u)  Is  cha  solution  to 

c,s)  •  hn<  t.u)*'^  (u,s>  -  •  :(u-«;]du,  (2.3) 

•  o  -  u 

'•  ^  *  i(‘  j,i  *ntl  i  •  £(>  "•  )  (>c  l«  ittuaad  co 

o*  tonn«4«clv*  | . 

Th«  «au*clon  corrsspondlns  co  (2.i)  for 
cooctnuou*  3b*#rv<clon*  would  b«  i  15,  pp.  198-201 ] 
cn«  list  II. 5)  but  wicn  ■  r«pl«c«d  by 

<.  u,n,  ch«  »ucocorr«l*ct on  of  ch«  nol>«  proc««»  . 
•tailirly  cli«  >B£  s..n  jqu4r«  «rror>  for  any 


wb«r«  r(ii)  Is  ch«  sp«ccr»l  KtulCy  s: 


tf  ch«  proctss  V_  io«s  noc  hav*  spaccral 

ianslcy  71.it)  but  has  4  spaccral  aaasura  a  chart 
(2.7)  oocooas  * 


*  (B,'V 


imt  -  aoN 


9) 


whart  da^  •  Vdut.  Kowavar  as  shown  In  [17J,  (2.8' 

rttaaJ.ru  cha  saaa  -vnara  now  cm)  Is  cha  darlvaclva 
of  cha  absolucaly  condnuou*  pare  of  a  -vich 
raspaec  co  cha  Labas?ua  aassura. 


Noca  chac,  In  all  of  cha  aoova  Jlscuaslon.  1c 
Is  aasuaad  chac  cha  sacond-ordar  charac carls ctes 
of  cha  raca  proeaaa  ;  c  2  )}  ara  an  own .  How- 

avar,  slnca  chasa  characcarlscics  ara  raraly  known 
axaccly,  va  wish  co  consldar  cha  fllcar  daslfn 
probla*  for  cha  caaa  In  which  cha  rsca  spaccrua  - 


\ 


-i  <nown  oniv  10  o*  vimin  s  convex  class  *  jf 
io«c::j  The  roousc  3iener  filtering  problem  men 
/echoes  ;i nd  :.nt  ievoreole  -  Jver  .  < 

: i nd  *  -  *’  solving 


men  resign  :m  ;pci3xio  : liter  for  :r.u  spectrum; 


-4  men  /id 


i  rooust  miniaaxi 


St  Therefore  mere  .s  *  *irect  i.nelogy 

-I  m  me  jjncinuoai  rose  rv,i  cions  ;ji<  vhen  me 
'oij<  res ir ic tea  :o  ;e  vnit«.  The  spectral 
T**aure  jr  me  noise  in  mis  case  is  i  Leoesgue 
neesure  over  -•,»'*  *nd  thus  the  Least- favorao le 
spectral  measure  for  me  signal  can  be  obtained 
for  capacity  classes  /l a  13'. 

If  me  spectral  density  is  bandlimited  then 
all  me  .uncertainty  classes  -mentioned  in  Section 
1. 3.3  ire  capacities;  jchervise  only  me  tend 
-nocel  gives  a  capectiy.  However,  in  the  non¬ 
causa  1  filtering  case  ina  for  an  < -contaminated 
signal  me  least-favonole  density  was  found  in 
S!  lollowi.ng  a  urect  ipproacn. 

:::.  urn^Ax  state  isttmation  tor  linear 

SYSTEIS  WITH  NOISE  TNCSRTAIN’TY 
In  this  section  we  m*a  ^  to  be 

:e fined  as  ioove  ana  tc  oe  an  ?_-*deptia  point 
process  with  jumps  )  or  l  ana  ccmpensecor  ^cs. 

:  :nsicer  tr.e  linear  stocnastic  system  cescrtbed  ov 
state  -3.  -X.  •  A  _  :t  -  3  _  :v_ ,  :*3  '2.D 

-t  serve  cions  so.  -  '  _-c  jX _  '0  :•*-<!  n_ ,  t  -  )  '2.3. 

*nere  »r.s3.  ire  matrices  3.  positive  lefmiti  ,  t. 


is  j  rector, 


u  a  .-ector  s tender  1  Wiener  orocess 


L.naependent  of  *1^,  me  covariance  of  X«,  is 
given,  IiX,  *0  ,  ana  n_  is  in  ?  ..-martingale .  The 
processes  ■/.  mo  a.  are  assumed  to  be  independent. 

It  _ 3  mown  in  12  ,p.  331  that  the  Kalman 
filtering  acuations  tor  me  jvscem  :e  scribed  in 
3.1  -'3.3'  are 

-:<2  ■  a./.*  •  I.o  *lf  :N  -  -  it  -  '3.3a 


a  -  * r  v  - 3.3: 


which  are  the  same  with  those  jf  a  conclnuous- 
joservetions  system  with  *  p  as  the  covariance  jf  a 

Wiener  process  type  noise.  Also,  the  performance 
of  my  lineer  filter  is  predicted  by  the  continuous- 
ooserva cions  analog  implied  by  (3.3).  In  other 
words  art  eauivelenc  con  cinuous-ob  serve  cions  model 
to  3.3  if 


ir .  -  *  .  *  o  IX.  dc  «•  O  m)  7  ivr  O.-** 

vnere  v_  is  a  standard  Wiener  process. 

If.  in  addition,  in  the  above  equations 
' <  3 , i 3 .2 ) i  we  assume  that  Ap  •  a,  3.  »  3, 

and  c_  •  c  ia  .  v#  nave  a  cime-inveriant 
ivstemi,  then  ana  ■  are  aide-sense  stationary 
ana  •  '■  as  m  Section  II.  In  this  casa,  we  can 

*se  the  noaei  of  ■  2.-*  to  apply  some  of  the 
results  of  10 1  <  again  33]  and  [31  Jill  be  help¬ 
ful  as  for  iection  II;  directly  to  the  problem  of 
minimax  mean-square -error  estimation  of  the  state 
in  *3.1)  for  situations  la  which  there  is  un¬ 
certainty  concerning  the  noise  structure  (i.e., 
uncertainty  In  3  or  in  the  covariance  matrix  of 
[v^.c  i  o] ) .  In  particular,  such  probleme  are 

solved  by  the  Kalman  filters  corresponding  to 
certain  least- favorable  noise  models  that  are 
characterized  in  (10). 

Remark  1.  The  model  of  the  race  as  *  •  *c’X, 
can  oe  interpreted  as  '■  being  the  carrier  and  c’X^.  — 

containing  the  information.  Thus  the  useful 


information  (assume  £(X 


0)  is  in  the  variation 


of  :t_,  while  the  pwer  of  the  "noise"  depends 


Remark  3.  If  ;3.i)  ("being  dX.  •  AXcit  -  3dv„  in 

the  stationery  case;  allows  X.  to  take  on  both 

negative  ana  positive  values  then  **p«V.-c’X- 

becomes  negative  for  some  range  of  X  and  t. 

c 

How.v.r  in  sicuaclon*  if  practical  inttrtst  ;Ms 
napp.na  visa  low  prooabilicy.  Tha  axac c  analyaia 
of  :ni*  ilcuaclon  remain*  an  opan  problaa  saa 
>i  1  for  a  dlicuuion  of  ;his  problaa). 
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Abstract 

The  problem  of  linear-quadratic-Gaussian  control  of  multivariable  linear 
stochastic  systems  with  uncertain  second-order  statistical  properties  is 
considered.  Uncertainty  is  modeled  by  allowing  process  and  observation 
noise  spectral  density  matrices  to  vary  arbitrarily  within  given  classes, 
and  a  minimax  control  formulation  is  applied  to  the  quadratic  objective 
functional.  General  theorems  proving  the  existence  and  characterization  of 
saddle-poinc  solutions  to  this  problem  are  presented,  and  the  relationship 
of  these  results  to  earlier  results  on  minimax  state  estimation  are 
discussed.  To  illustrate  the  analytical  results,  the  specific  example  of 
regulating  a  double-integrator  plant  is  treated  in  detail. 
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1.  Introduction 

Ihe  design  of  optimum  decision  and  control  procedures  for  a  linear 
stochastic  system  requires  an  accurate  description  of  the  statistical 
behavior  of  the  system.  However,  because  of  nonideal  effects  such  as 
nonstationarity ,  nonlinearity,  and  other  modeling  inaccuracies,  there  is 
always  a  degree  of  uncertainty  in  such  statistical  descriptions.  A  useful 
approach  to  design  in  the  presence  of  small  modeling  inaccuracies  is  to 
use  a  game-theoretic  formulation  in  which  one  optimizes  worst-case 
performance,  and  this  approach  has  been  applied  successfully  to  many 
aspects  of  decision  and  control  system  design  (see,  for  example,  Huber  [1] 
and  Mincz  [2]).  In  a  recent  paper  [3],  two  of  the  authors  have  applied 
this  approach  in  considering  the  problem  of  designing  linear  minimax-mean- 
square-error  state  estimators  for  linear  systems  observed  in  and  driven  by 
noise  processes  with  uncertain  second-order  statistics.  In  particular, 
it  is  shown  in  [3]  that,  for  two  general  formulations,  such  estimators  can 
often  be  designed  by  designing  linear  minimum-mean-square-error  filters 
for  least-favorable  pairs  of  noise  spectra  or  covariance  matrices.  Related 
minimax  state  estimation  results  are  found  in  a  paper  by  Morris  [i] . 

In  this  paper,  we  consider  the  analogous  problem  of  minimax  linear- 
quadracic-Gaussian  control  (LQG)  of  systems  with  uncertain  second-order 
statistics.  In  particular,  we  consider  the  control  of  linear  multivariable 
systems  with  white  Gaussian  process  and  observation  noises  with  uncertain 
spectral  density  matrices.  It  is  3hown  here  that,  within  mild  conditions, 
this  problem  can  be  solved  by  designing  an  optimal  control  for  a  least- 
favorable  modal,  although  the  model  which  is  least-favorable  for  control 
may  not  be  the  same  as  that  which  is  least-favorable  for  state  estimation 
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for  che  same  type  of  noise  uncertainty.  However,  it  is  also  shown  chat,  for 
uncertainty  in  eitherthe  process  or  observation  noise  only,  a  given  minimax 
li.oear-quadratic-Gaussian  control  problem  does  have  the  same  least-favorable 
mocel  as  does  a  particular  minimax  state  estimation  problem  with  a  weighted- 
mean-square-error  criterion.  Thus,  as  might  be  expected,  a  limited  duality 
exists  between  these  two  problems.  Another  phenomenon  which  is  shown  to  be 
associated  with  minimax  cont.ol  is  that  Che  separation  principle  which 
separates  the  problems  of  optimal  control  and  opcimal  state  estimation  is 
not  necessarily  valid  for  minimax  control  and  minimax  state  estimation.  In 
particular,  it  is  shown  chac,  although  the  miniaax  control  law  i3  independent 
of  the  minimax  3tate  estimator,  the  reverse  is  noc  true.  Several  ocher  aspects 
of  this  problem  are  also  considered  in  this  paper. 

In  Section  2,  the  specific  problem  formulation  to  be  considered  Is  given, 
and  several  relevant  properties  of  optimal  LQG  control  are  outlined.  The 
general  minimax  problem  is  treated  in  Section  3,  and  results  giving  conditions 
for  the  existence  and  characterization  of  saddle-point  solutions  are  derived. 
Section  4  includes  a  discussion  of  several  interesting  properties  associated 
with  che  general  miniaax  results  of  Section  3,  and  the  specific  example  of 
controlling  a  double-integrator  plant  with  uncertain  process  noise  statistics 
is  considered  in  detail  in  Section  5. 
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Preliminaries 

Consider  che  linear  time  invariant  stochastic  system 


x  *  Ax  +  Bu 
t  t  t 


'  t 


(1) 


y.  -  Cxc  -  9C  (2) 

vhere  and  are  in  F.n,  u^  is  in  ]Rm,  and  y ^  and  3^  are  in  for  each  t. 

The  matrices  A,  3  and  C  are  assumed  to  have  compatible  dimensions  (as  required 
by  (l)-(2)>  with  tne  pairs  (A, 3)  and  (A,C)  stabilizable  and  detectable 
respectively.  The  noise  processes  5^  and  3C  are  assumed  to  be  zero  mean  white 
Caussian  processes  with  second  order  statistics 


E?c  5*  -  2  5  (t-s)  (3) 

E  et9*  -  9  0  (t-s) 


where  '  is  the  Dirac  impulse.  It  is  assumed  that  (A.^/S  )  is 
chat  3  >  0.  The  objective  of  the  problem  is  to  choose  ufc  to 
average  quadratic  cost 


stabilizable  and 
minimize  the  time 


J  » 


W 


where  Q  a  0  with  (A.VQ  )  detectable  and  R  >  0. 

when  »  and  3  are  known,  the  solution  to  the  stochastic  regulator  problem 
(l)-(4)  is  given  by  the  feedback  system: 

u_  -  -Cxc  (5) 


x£  ■  Ax£ +But -*-H^yc-Cxc^ 


(6) 


The  matrices  K  and  P  are  the  unique  positive  semi-definite  stabilizing  solutions  to 
(3)  and  (10)  respectively. 

As  discussed  in  Section  1,  the  second  order  statistics  for  the  processes 
* _  and  3^  are  often  not  known  precisely.  A  common  representation  of  this  type 
of  uncertainty  is  to  assume  that  5  and  3  are  contained  in  compact  sets  2  and  7? 
respectively.  The  objective  is  then  to  choose  u^  to  minimize  the  worst  possible 
performance  (i)  given  (2,3)  32x7?.  We  will  restrict  our  consideration  to 
controls  generated  by  causal  appropriately  measurable1  functions  of  the 
measurement.  Denote  this  class  of  operators  as  2g  .  The  problem  can  then  be 
stated  as  the  minimax  problem: 

min  max  J(L,Z,3)  (11) 

L  :  I  (H,3)€  2x7? 

s 

where  the  dependence  of  J  defined  by  (l)-(4)  on  L,  2,  and  0  has  been 

explicitly  noted.  Note  that  the  optimal  linear  feedback  law  defined  by 

(5)-(10)  is  a  member  of  Z~  . 

s 


^See,  for  example,  Chapter  16  of  [7]  for  the  explicit  measurability 
conditions . 
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3 .  Existence  and  Characterization  of  a  Saddlepoint 

Two  important  results  concerning  solutions  to  the  minimax  problem 
formulated  in  section  2  are  presented  in  this  section.  The  first  result 
establishes  an  equivalence  between  a  saddlepoint  solution  to  (11)  and  an 
optimal  stochastic  regulator  solution  (5)- (10)  corresponding  to  a  particular 
(2,3)  pair.  The  second  result  establishes  the  existence  of  a  saddlepoint 
when  the  secs  Z  and  7?  are  convex. 

To  obtain  these  results,  we  will  need  the  following  well-known 
cheorem  (cf.  [5])  which  establishes  the  fact  chat  the  existence  of  a 
saddlepoint  is  a  necessary  and  sufficient  condition  for  the  minimax  problem 
(11)  to  be  equivalent  to  the  corresponding  maximin  problem 

max  min  J(L,H,3)  (12) 

(S,9)€2xr 

Theorem  1;  There  exists  a  triplet  (Lo,2o,3q)  €  x  Z  x  V.  satisfying  the 
saddlepoint  condition 

J(L  .3,8)  s  J(L  ,H  ,3  )  S  J(L,2  ,8  )  (13) 

0  OOO  00 

n«i+,  3  €  2,  8  €  n 
3 

if  and  only  if  the  values  of  (11)  and  (12)  are  equal. 

We  will  also  require  the  following  lemma  which  expresses  the  cost  for 
any  2  and  3  when  the  control  is  generated  by  (5)- (8)  with  H  being  any 
matrix  such  chat  (A-HC)  is  asymptotically  stable. 

Lemma  1 :  Assume  that  the  control  u£  is  generated  by  the  system  (5)-(6) 
with  feedback  gain  G  determined  by  (7)- (8),  and  that  H  is  any  matrix  such 
chat  all  eigenvalues  of  (A-HC)  have  negative  real  parts.  Then  the  cost 


J  defined  by  (l)-(8)  is: 


J  *  tr  i  K  +  tr(i  t  H3HT)X  (14) 

where  K  is  given  by  (8)  and  X  is  the  unique  positive  semi-definite  solution 
of 

(A-HC)TX  +  X(A-HC)  +  GTRG  *  0  (15) 

Proof:  a  proof  is  provided  in  the  Appendix^. 

Theorem  2  provides  the  desired  characterization  of  a  saddlepoint. 

Theorem  2:  Assume  there  exists  3  €  Z  and  9  S  ??  which  satisfy 

_______  oo 


tr'2  Y}  s  tr(SQ  Y} 

Y  2  €  X 

(16) 

tr'SH  XH1}  *  tr{9  H  XHTj 
0  0  0  0  0 

Tsen 

(17) 

where  H  is  the  Kalman  filter  gain  corresponding  to  2  and  9  (given  by 
0  0  0 

(9)-(10)),  X  is  given  by  (15),  Y  is  the  solution  to 

(A-H  C)TY  +  Y (A-H  C)  +  0  +  KH  C  +  CTHTK  -  0  (18) 

O  O  x  0  o 

and  G  and  K  are  given  by  (7) -(8).  Let  Lq  be  the  operator  representing 

the  optimal  stochastic  regulator  ( 5) - (6 )  corresponding  to  *q  and  0Q.  Then 

(L  ,2  ,9  )  is  a  saddlepoint  solution  to  (11). 
ooo 

Conversely,  assume  that  (LQ,2o,90)  is  a  saddlepoint  for  (11).  Then  Lo 
is  the  LQG  regulator  (5)-(10)  and  (Sq,90)  satisfy  (15)-(18). 

*“This  result  may  also  be  developed  using  the  results  on  pp.  185-186  of  [8]. 
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Proof:  (Sufficiency)  Consider  Che  maximin  problem  (12).  Lee  3q  and  3q 

satisfy  (15)- (18)  and  let  Lq  be  Che  corresponding  optimal  stochastic 
regulator .  Let  Hq  be  Che  Kalman  gain  for  Sq  and  3^  given  by  (9)-  (10) .  Then,  by  lemma  1 

J(L  ,3,3)  -  trf 3  (X  +  K) }  +  tr{H@HTx}  (19) 


for  every  (3,3)  € Z x  Adding  (15)  and  (8)  gives: 

(A-H  C)T(X  +  K)  +  (X  +  K)  (A-H  C)  +  Q  +  KH  C  4-  CTHTK  =*  0  (20) 

0  0  0  0 

Hence 


Y  -  X  +  K 


(21) 


Also,  by  (16) 


tr[3(X+K)}  S  tr[ Sq (X  +  K)  ] 


(22) 


Adding  (22)  and  (17),  and  using  (19)  gives  the  lower  inequality  of  (13) 


J(Lo,3,3)  s  J(Lo,So,3o)  Y3€Z,9€7l  (23) 

The  upper  inequality  of  the  saddlepolnt  condition  (13)  follows  trivially 
from  the  fact  that  Lq  is  the  optimal  stochastic  regulator.  Thus, 

(1.^,3  ,3q)  is  a  saddlepoint  for  (11). 

(Necessity)  Suppose  (L0i2o>3o)  satisfies  (13).  The  upper  inequality  of 
(13)  implies  that  Lq  is  the  optimal  stochastic  regulator  (for  which  one 
realization  is  (5)* (10)).  Hence  lemma  1  can  be  used  to  express  the  cost. 
The  lower  Inequality  and  lemma  1  imply: 
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tr'  2  K]  *  trf  (2  +  H  3HT)X;  £  trf  2  !<}  *  tr<  (2  3  HX)X^  (24) 

0  0  '  0  "0  0  0  0 

for  every  2  i  Z  and  9  3  7J.  3y  (21)  this  can  be  written  as 

tr "  (3-2  )Y}  +  trf  (3  -  3  )HT  XH  }  s  0  7  232,9  3?? 

•  o  o'  o  o 

In  particular,  9  ■  9q  gives  (16)  and  S  *  2q  gives  (17). 

Thus,  we  see  chat  conditions  (15) -(13)  are  equivalent  to  the  existence 

of  a  saddlepoint.  If  such  a  saddlepoint  exists,  then  the  minimax  controller 

is  simply  the  optimal  stochastic  regulator  designed  for  the  particular 

(2  ,9  )  pair  which  satisfies  (15)- (18).  This  result  can  be  used  to 
0  0 

establish  the  existence  of  a  saddlepoint. 

Theorem  3 :  Assume  2  and  7?  are  convex,  compact  sets  such  that  if  2  €  Z 
then  2  i  0  and  (A,v2)  is  stabilizable  and  if  9  €  7?  then  9  >  0.  Then  a 
saddlepoint  solution  for  the  minimax  problem  (11)  exists. 

Proof :  The  proof  shows  that  a  solution  to  the  maximin  problem  (12)  exists 
and  satisfies  conditions  (15)-(13)  of  Theorem  2. 

By  Lemma  1,  and  equations  (7)- (10)  and  (15), 

min  J(L,2,9)  *  trXK  +  tr(2  +  H®5T)X 

(26) 

«  M(2,9) 

is  continuous  in  2  and  9  (with  H  given  by  (9)- (10)  for  each  I  and  9) .  Since2  and  7J  are 

,2  ,9  )  be  such  a  solution. 


(25) 


compact,  a  solution  to  (12)  exists.  Lac  (L 
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Then  che  Frechec  differencial  of  (26)  with  respecc  co  2  and  9  ac(5  ,9  ) 

o  o 

trnsc  be  nonposicive  in  every  direction  inco  che  sec  Z  *  T[.  The  Frechec 
differencial  of  (26)  is  given  by: 

;M(2,3;A2,A®)  -  trtA2(K  +  X)] 

+  tr{A®HTXH} 

.  .t  -  (27) 

+  tr[9(5H  XH  +  H  X6H)} 

*  cr{  (2  +  H  9  HT)6X} 

In  (27),  5H  and  :X  represent  che  Frechet  differentials  of  H  and  X  with 
respecc  to  2  and  3.  From  (15),  5X  can  be  computed  as  the  solution  of 

(A-HC)T5X  +  5X(A-HC)  -  Cr5HTX  -  X5HC  -  0  (28) 


Thus,  5X  is  given  by: 

5X  » 


e  (A-HC)  tfcT5-Tx  +  x6jjC]e  (A'^C)tdt 


(29) 


Substituting  (29)  into  (27)  and  using  a  few  trace  manipulations  gives 


5M(2,3;£2,AS)  -  crj>3(X  +  K)}  +  tr{A©HTXH} 


-  tr{ ^  e(A"HC)C(2  +  H3  HT)e(A‘HC)  Cdt  [CT5HTX  +  X5HC] } 
+  trf PCT5 fiTX  +  PX5HC] 


(30) 


But  che  integral  in  the  third  term  of  (30)  is  Che  solution  to  (10);  i.e., 
P.  Hence 


aM(2,3;2S,A9)  -  tr[&S(X  +  K)}  +  tr{i®HTXH} 


(31) 


Consider  an  arbitrary  point  (2,9)  €  Z  x  71.  Since  Z  and  "?  are  convex, 

the  line  segment  joining  (I  ,9  )  and  (2,9)  is  in  Z  x  71  and  hence 

0  0 
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(A2.A3)  -  (2  -  S  ,3  -  3  )  (32) 

o  o 

is  a  direction  into  Z  x  *J.  Substituting  (32)  into  (31),  requiring  (31)  to 
be  nonpositive  and  using  (21)  gives; 

tr*  (2  -  2  )Y}  +  tr{  (3  -  3  )HT  X  H  }  s  0  (33) 

The  choice  (2,3)  •  (2,3q)  in  (33)  gives  (16)  while  the  choice  (2,3)  * 

(2  ,3)  in  (33)  gives  (17).  Thus,  by  Theorem  2,  (Sq  ,3q)  is  a  saddlepoint 
for  (23). 

□ 


This  section  has  provided  two  major  results.  First,  every  saddlepoint 
solution  to  the  minimax  problem  formulated  in  section  2  has  been  character¬ 
ised  by  che  conditions  of  theorem  2.  In  addition  to  providing  a  means  of 
identifying  a  particular  solution,  these  conditions  can  be  used  to 
characterize  the  set  of  possibLe  solutions.  This  subject  will  be  addressed 
further  in  the  next  section.  Theorem  3  provides  the  second  important  result 
by  demonstrating  the  existence  of  a  saddlepoint  solution  to  the  minimax 
problem  when  the  sets  Z  and  Z.  are  convex  and  compact. 


There  are  several  interesting  observations  which  can  be  made  concerning 

the  results  of  the  previous  section,  first  we  note  that,  since  (6),  (9), 

and  (10)  give  the  linear  least-squares  state  estimator  for  a  fixed  (3,2) 

pair,  the  optimal  linear  regulator  problem  for  fixed  (0,2)  is  solved  by 

feeding  back  optimal  state  estimates  through  Che  gain  G  (which  does  not 

depend  on  (3,2)).  Thus,  as  is  well  known,  there  i3  a  separation  between 

the  estimator  and  regulator  design  problems  in  the  case  of  fixed  (0,2). 

However,  it  follows  from  Theorem  2,  (16),  and  (17)  that  such  a  separation 

does  not  generally  exist  in  Che  minimax  problem.  In  particular  we  see  from 

Theorem  2  chat,  although  the  feedback  gain  does  not  depend  on  (3,2),  the 

state  estimates  used  for  minimax  control  are  not  generally  the  minimax- 

mean-square-error  state  estimates.  This  follows  because  the  equations 

determining  the  least-favorable  pair  for  control  depend  directly  on  the 

cost  matrices  Q  and  X,  which  of  course  have  no  effect  on  which  pair  is 

least-favorable  for  state  estimation  (as  in  [3]). 

The  above  observation  also  implies  that  the  (3,2)  pair  which  is 

least  favorable  for  control  is  not  necessarily  the  same  as  that  which  is 

least  favorable  for  state  estimation.  However,  the  conditions  that  Theorem 

2  requires  for  minimax  control  are  similar  in  structure  to  conditions 

required  by  Theorem  5  of  [3]  for  minimax  state  estimation.  Using  the 

similarity  it  follows  that,  for  fixed  2,  the  Kalman  filter  corresponding 

to  (0  ,2)  where  3  is  from  (17)  also  solves  the  minimax  state  estimation 
o  o 

problem 

min  max  E‘f  (x  -  x  )T  GTRG  (x  -  x  )}  (34) 

'x  c  c  00  1  c 

where  G  is  the  regulator  feedback  gain  from  (7).  a  similar  statement 
o 


applies  if  3  is  fixed  and  3  is  unknown;  however  if  both  3  and  3'  are  unknown, 
there  generally  is  noc  a  single  minimax-mean-squara-error  state  estimation 
problem  which  has  the  same  least  favorable  pair  as  (5). 

A  maximal  element  3^  for  the  set  £  is  one  which  satisfies  3  s  3^  for 
all  3  i  Z,  where  3  £  3^  means  that  (3  -3)  is  nonnegative  definite.  It 
was  noted  in  [3]  that  if  Z  or  "7  has  a  maximal  element  then  that  element 
is  least  favorable  for  state  estimation.  3y  inspection  of  (16)  and  (17) 
we  see  that  maximal  elements,  when  they  exist,  are  also  least  favorable 
for  the  regulator  problem. 

Example :  Assume  that  the  process  noise  can  be  written  as 

»  m  n? 

»t  -t 

where  I?  is  a  zero  mean  white  Gaussian  process  with 

w 

ECfcSj  =*  ii(t-) 

and  where  (A,D)  is  a  stabilizable  pair.  A  common  mechod  of  modeling  uncertainty 
in  the  second  order  a  priori  statistics  of  system  (l)-(2)  while  preserving  the 

A 

input  structure  of  the  process  noise  is  to  choose  a  nominal  pair  (2^,3^)  and 

A 

assume  that  the  true  3  and  3  differ  from  the  nominal  in  norm  by  no  more  than 
some  positive  constant  Define 


A  CD  3  DT:  i 


-  I,,  ^  "} 


n  a  [3:  ,!»  -  3n,;  < 

where  •  denotes  the  norm  induced  by  the  Euclidean  vector  norm  on  the 

^  n 

underlying  space  (1R  and  Ik  r  respectively)  and  where 


-  > 


Then  each  set  has  a  maximal  element 
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2o  *  +  "I)dT 

9  »  a  +  "T 

o  N 

and,  by  che  above  discussion  che  minimax  concroller  is  che  LQG  concroller 

designed  for  2  *  2  and  9=3.  — 

o  o  — 

Of  course,  most  uncercaincy  classes  of  incerest  will  noc  concain  maximal 
elemancs;  however,  for  any  compacc  classes  Z  and  7?  che  sec  of  possible  lease- 
favorable  pairs  can  be  reduced  co  only  chose  on  che  upper  boundaries  of  Z 
and  77,  where  che  upper  boundary  [6]  of  Z  is  che  sec  of  2  sacisfying 
■*F  “  Z\  F  Ss  h]  =  (s} .  (Note  chac  che  upper  boundary  of  a  sec  wich  a  maximal 
elemenc  is  jusc  chac  maximal  elemenc.)  Furthermore,  noce  chac  che  condicions 
(16)  and  (17)  are  sacisfied  for  all  (2,3)  pairs  in  Z  x  77  if  chey  are  sacisfied 
for  all  pairs  on  che  upper  boundaries  of  Z  and  "7.  Thus,  by  Theorem  2  che 
minimax  problem  on  Z  *  71  can  be  equivalencly  defined  an  che  upper  boundaries 
of  Z  and  V .  Moreover,  che  requirement  chac  Z-  and  7?  be  convex  can  be  relaxed 
co  che  requirement  chac  their  upper  boundaries  are  also  che  upper  boundaries 
of  cheir  respective  closed  convex  hulls.  We  may  summarize  these  observacions 
in  che  following  theorem: 

Theorem  Suppose  the  following  cond. cions  hold: 

(i)  Z  and  77  concain  cheir  resoeccive  upper  boundaries  3*,  and 

/<#  “ 

(ii)  3^,  and  3^  are  che  upper  boundaries  of  cheir  respective  closed 
convex  hulls  co(J3^)  and  co(„2^). 

(iii)  7  2  €  co(/*J,  (A^/2)  is  a  scabilizable  pair. 

A. 

Then: 

min  max  J(L,H,3) 

l (35) 

»  min  max  J(L,2,3). 

(2, 9)63* 

3  Am 


N’oce  chat  Theorem  i  implies  that  'he  compactness  and  convexity  conditions 
required  for  existence  in  Theorem  3  need  only  apply  on  and  near  the  upper 
boundaries  of  X  and  "7 . 
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5 .  Examo  le 

To  iliuscrate  Che  results  of  che  above  sections  we  consider  the 
following  example  which  corresponds  Co  concrol  of  a  doub le-incegracor  plane: 


0  i' 


o' 


c  -  [1  0]  ; 


3 


q 


2J 


R  ■  1 ;  and  Q  *  t 

_Q  Oj 


(36) 


where  r"  >  0.  Note  Chat  any  compact  sec  of  3 1 s  has  a  maximal  element  in 

2 

this  case  given  by  che  maximum  value  of  r”  in  che  sec.  Thus,  no  significant 

2 

generality  is  last  by  assuming  Chat  r“  is  fixed  at  sup  For  fixed  q^,q, , 

and  ,  the  matrix  ?  solving  (10)  is  given  by 


I  (37) 

/  “  . _ (  ~~  "  I 

./q:(2n,q;  *  qL)  •  q3j 


*  :ua  1  s 


(381 


(39) 


*  xaj  / 


;  where 


2^2(2z^'q2  +  qL)  +  r  +  2^  q2 
2*/2Ev  qj  * 

r"  +  6v  q^  r  +  2qL  -  2(r  +  v  -  (2 iv/q,'  +  qx 
2v'q2  V'2rvq2  +  qL 


A  convex  uncertainty  class  for  3  is  equivalent  to  a  convex  subset  of 

j  2 

. q  c  R  !  q^  a  0,  q,  i  0,  and  q^q2  2  q^}  •  Equation  (16)  and  the  fact  Chat 
Y  is  diagonal  imply  that  Eq  i  Z  is  least  favorable  iff. 

(q2(0)  -  q2)y2'0)  +  (q{0)  -  q^y^  2  0;  Y  3  €  Z,  (42) 


where  superscripts  'o’  denote  quantities  corresponding  to  5q.  It  follows 
from  (14),  (15),  (33),  and  (42)  chat  the  state  estimation  filter  structure, 
the  control  gain,  and  the  control  cost  are  all  independent  of  q^.  Thus  we 
can  set  q^  ■  0  without  loss  of  generality.  (Note  that  tr(P)  does  depend  on 
q 2 • )  To  illustrate  che  solution  to  (42;,  we  consider  the  two  uncertainty 
classes  (q^  *  0) 


Ej  q  >  0 ,  q  >  0 ,  and  aaxf  j  q  -  q . 


„  „  (N)  i  q  , 


ilq  >  0,qo  >  0,  and  |  q 


_  __  \  *  *  /  1  ,  ■  _  _  ' 
1  ql  1  |q2  q2 


o 


0 
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represents  a  nominal  model  for  the  state  noise  and  s  is  a  fixed  degree  of 
uncertainty  in  the  model.  Note  that  Z ^  has  a  maximal  element 


00 

1 

0 


+  s 


-t* 


which  thus  yields  a  minimax  design  immediately  for  this  case.  The  class 
Z ,  does  not  have  a  maximal  element  for  t  >  0,  but  Theorem  3  allows  us  to 
restrict  consideration  to  the  upper  boundary  of  Z ^  given  by 


“3 


C 


(45) 


Thus  (42)  reduces  in  this  case  to 


(q2(0)  -  cl2)(y20)  '  yl3))  *  0;  1  q2  S  f^2N),q2N)  +  s]l 


(46) 


The  Least  favorable  case  is  thus  q£°^  *  q|^  and  4°^ 


q2N)  + 


y  2  y  f0'  at  this  point;  it  is  qj0"* 
1  l  1 


00  .  (o) 

q1  +  e  and  q. 


u 

00 


y['>)  £  y,*'0'*  at  this  point;  otherwise  the  solution  must  be  a  point  on  2v 

/  L 


satisfying  y 


Co) 

1 


,(o) 


L8 


appendix 


Proof  of  Lemma  I:  Define 


>  =  x  -  x 

c  c  C 


rhea,  corr.bir.ing  (D,  (I),  (5),  and  (6)  with  (Al)  : 


d r :i;]  rA_3G 

3G  1 

r  xti 

1 — 

u 

*•« 

L 

d  t  ■  ;  n 

e  0 

i 

A-HC 

e 

i  -  ns 

r  ! 

j 

L  c- 

L"c  CJ 

Since  (A-HC)  is  stable,  che  processes  x  and  er  are  ergodic .  Hence  the 
can  be  written  as 

J (1,3,3)  3  lia  E(x"  Qx^  -  Ruc) 

r  -•  <a 

Subs  ti  rating  (Al)  and  \5)  in  (A3)  gives: 

J  »  lira  E  [x“  (Q  -  G'RGJx^-x^  G'RGe^  -  e^G^RGx^  +  e^G  RGe^} 

c  -  « 

Pse  of  a  simple  crace  idenciey  converts  (A4)  into: 


Define 


lim  cr 

■  a 

r  Q  *  G"RG 

! 

i  -G‘RC 


Q-GTRG  -GTRG  ]  Jfxe|xj  Je] 
G 


-G  ‘"KG 


1  E  <  “ 

sc  j  U*tJ 


_T  - 
-G  RG  I 

T  1 
G  RG 


^  -  -  -  - 
lim  E  <  |  ^  !  1  *C  |  \ 

--  U*J  i-e=J  J 


A  -  3G  BG 


“I 


A-HC 


3  1 

2  HSHT  j 


(Al) 


(A2) 


cos  c 

(A3) 


(A4) 


(A3) 
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Then  (Ai)  becomes 


J  »  tr  Q  Z 

where,  in  view  of  (A2) ,  Z  is  che  unique  solution  of: 


A  Z  +E  AT  +  X  =  0 

5ince  A  is  stable,  Z  can  also  be  'written  as: 

— T 

»  AC  AC 
Z  *  **  e  X  e  dt 

“  0 


Hence,  (A6)  becomes 


«  At  _  At 

J  ■  tr  Q  ”  e  X  e  dc 

“  0 

With  a  few  manipulations,  (A9)  can  be  rewritten  as 

— T  — 

_  •  At  _  At 

J  *  cr  X  "  e  Q  e  dt 

“0 


;erine 


then 


where 


Lee 


-  T 

_  •  AC  _  At 

X  *  ^  e  Q  e  dt 
“0 


J  *  Cr  X  K 


AT  K  *  K  A  +  Q  =  0 


K1  *2 

K*  X 


(A6) 

(A7) 

(A  8) 

(A9) 

(A10) 

(All) 

(A  12) 

(A  13) 
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Using  the  definitions  of  A,  K  and  Q  leads  to  three  equations: 


(A  -  30^  +  K1(A  -  3G)  +GTRG  +  Q  -  0  (A  14) 

(A  -  3G )  TK?  +  it,  (A  -  HC)  +  K13G  -  GTRG  *■  0  (A15  ) 

(A -HC)rx  +  X(A -HC)  +GTRG  ^  tC^BG-t- GT3TK,  =■  0  (Al6) 

Combining  (a14)  with  (3)  gives 

K  -  Kx  (A17) 

Combining  (Al7) ,  (A 15 )  and  (7)  gives 


^2  -  ° 


(A13) 


Hence , 


r 


(A19) 


where  X  (by  virtue  of  (A16)  and  (Al7)  is  given  by  (15) •  Finally,  substituting 
(A19)  into  (Al2)  and  using  the  definition  of  X  yields  (14). 
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MINIMAX  LINEAR  SMOOTHING  FOR  CAPACITIES 
Summary.  Minimax  linear  smoothers  are  considered  for  the  problem  of 
estimating  a  homogeneous  signal  field  in  an  additive  orthogonal  noise 
field.  A  minimax  game  with  the  quadratic-mean  estimation  error  as  an 
objective  function  is  used  to  formulate  this  problem.  Uncertainty  in 
signal  and  noise  field  spectra  is  modeled  using  general  nonparametric 
classes  of  measures  proposed  by  Huber  and  Strassen  for  the  problem  of 
minimax  hypothesis  testing.  These  classes,  which  are  described  in  terms  of 
Choquet  alternating  capacities  of  order  2,  include  the  conventional  models 
for  spectral  uncertainty  and  admit  a  general  solution  to  the  minimax  linear 
smoothing  problem. 
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L.  Introduction.  Suppose  we  observe  the  random  field  [Yz; z  S  Rn]  given 
for  each  z  5  Kn  by  Yz  *  (Sz  +  N  )  where  !_  S  z ;  z  €  ]Rn}  and  { Nz ;  z  6  Rn]  are 
orthogonal  random  fields,  each  of  which  is  second  order,  homogeneous,  and 
quadratic-mean  continuous.  Suppose  further  that  h  is  a  comp lex- valued  Borel- 
measurable  function  on  Rn,  and  that  Sz  denotes  that  the  linear  estimate  of 
S,  based  on  <*Y  ;  z  €  Rn]  which  has  transfer  function  h.  Then  the  quadratic- 
mean  estimation  error  associated  with  Sz  is  given  by 


Ef  |  S  -S  |2}  -  <2tt)“  V  |  l-h|  2dm, 

2  “E 


s  '  daM]  “  e(h;°S*V 


where  m ^  andrr^  are  the  spectral  measures  on  CRn  associated  (via 

Bochner’s  theorem  [1,  p.  245])  with  {S  ;  z  6  Rn]  and  f N  ;  z  €  Rn}  , 

z  z 

respectively.  For  fixed  ms  and  m^,  the  minimum  possible  value  of  e^jmg.m^) 
is  achieved  by  the  estimate  with  transfer  function  fi  *  dm^/dCm^+m^)  and 

lm^.  If,  on  the  other  hand. 


(1) 


this  minimum  value  is  given  by  (2rr)  n [  h  dn  ^ 


m  and  ra.  are  known  only  to  be  in  classes  77L  and  771 ,  respectively,  of 

j  ki  o  iN 

spectral  measures  on  (E  ,  3°),  then  a  reasonable  design  strategy  is  to  find 

a  linear  estimate  whose  transfer  function  minimizes  sup  e(h;m  ,nO. 

%  X  %  5 

Such  an  estimate  will  be  a  minimax  linear  smoother  for  771^  and  Certain 

aspects  of  this  problem  have  been  considered  by  Kassam  and  Lim  [2]  and  by 
the  author  [3].  In  this  paper  we  consider  the  minimax  linear  smoothing 
problem  for  the  situation  in  which  the  measure  classes  771$  and  ^  are  of  the 
type  generated  by  2-altemating  capacities  as  considered  by  Huber  and 
Strassen  [4]  in  the  context  of  minimax  hypothesis  testing.  Examples  of  this 
type  of  class  include  mixtures,  Prohorov  and  Kolmogorov  (variational) 
neighborhoods,  and  other  previously  considered  models  for  spectral  uncertainty. 


Note  that  e(h;m  ,m  )  -  (2 r) 

O  kl 


‘"..a  Ur)'";  |i.-h|2J(«s+V, 

XV  XV 


Hera  we  apply  the  results  of  Huber  and  Strassen  to  find  the  structure  of 


minimax  linear  smoothers  for  general  models  of  this  type. 

2.  The  minimax  smoother  for  capacity  classes.  In  the  following,  2  denotes 
a  fixed  subset  of  Ra,  Cl  denotes  the  Borel  a-algebra  on  2,  and  denotes 
the  class  of  all  finite  measures  on  (f.,27).  Recall  that  a  finite  set 
function  v  on  Cl  is  a  2-alternating  capacity  (see  Choquet  [5])  on  (Q ,&) 
if  it  is  increasing,  continuous  from  below,  continuous  from  above  for  closed 
sets,  and  if  it  satisfies  v(0)  «  0  and  v(A  U  B)  +  (A  0  B)  s  v(A)  +  v(B) 
for  all  A,  B  ‘z  Cl.  For  a  2-altematir.g  capacity  v  on  define  the  set 

\  by 

\  -  {m  €  m  \  m(A)  £  v(A)  for  all  Aid,  and  m(fi)  -v(p)}.  (2) 

A  number  of  properties  of  classes  of  the  form  of  (2)  have  been  developed  by 
Huber  and  Strassen  [4].  Note,  for  example,  that  7! is  weakly  compact  and 
that ,  if  v  is  a  measure ,  then  »  { v} . 

For  any  pair  (vg>vj_)  of  2-alternating  capacities  on  (C.,£T)  there  exists 
a  Radon-Nikodym  derivative  dv^/dv^,  introduced  in  [4],  which  has  the  defining 
property  that,  for  each  t  €  [0,»], 

^({dv^/dvQ  >  t})  -  inf  rt(A)  (3) 

A  €  d 

A  _1  c 

where  v^(A)  *  (1  +  t)  [tVg(A)  +  v^  (A  )].  This  derivative  (which  is  a 

family  of  functions  having  the  defining  property  (3))  is  the  basis  for  the 
rninimax  tests  between  capacity  classes  of  the  form  of  (2)  as  considered 
in  [4] .  Further  properties  and  a  generalization  of  this  derivative  have  been 
considered  by  Rieder  [6].  In  this  context  we  state  the  following  result 
which  is  Theorem  4.1  of  [4]: 
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Lemma  2.1  (Huber-Strassen) :  Suppose  v  and  v  are  2 -alternating  capacities 

and  -ta  is  a  version  dv  /dv„.  Then  there  exist  measures  q_  S  7?  and 
0  S  N  'Vg 

€  71  such  that  -q  €  dq^/dq^  and  such  that 

V^O  <  t})  =  vs({tt0  <  t}) 

and 

%^n0  >  c*>  "  Vfno  *  t}) 


for  all  t  6  [0,»] . 

Let  JC  denote  the  class  of  all  complex-valued  £?-measurable  functions  on 
Lemma  2.1  leads  to  the  following  theorem: 

Theorem  2.2 :  Suppose  vg  and  v^  are  2-altemating  capacities  on  ( C.,(7) .  Let 
~0  be  a  version  of  dvs/dv^  and  choose  (<ls»qN)  as  in  Lenma  2.1.  Define 
h0  *  V1+V  1*  Then  fh0’^qS'qN^  is  3  3addle-Point  solution  to  the  game 

min  sup  e(h;m  ,m  ) 

h5ic  <WeV\. 

S  N 

where  e  is  defined  in  (1),  and  thus  tig  is  a  minimax  linear  smoother  for 

7?  and  7 1  . 

v  v 

S  N 

Proof:  Noting  that  hQ  €  dq^/d (qg  + q^) ,  we  have  directly  that 

e(VqS*V  2  e(h;vV 

for  all  h  :  K,  Thus,  it  is  sufficient  to  show 


e(hOiaiS'V  5  e(VW 


(4) 


6 


for  ,11  (.5,^)  €  3!s  X  51,.  t— .  2-1  — 'h“  "o  15  *=<=>=‘>«,3C103lly 
sm.ll.st  ov.r~„  udd.r  ,s  and  la  .tochm.tlc.lly  o«.r  »dar 

and  |h0j  2  * 


q  Thus,  sinceS|l-h0i2  -  (1+V'2  iS  decraasin8  *  ^0 


?liV 


and 


"2  is  increasing  in  ttq,  we  have 


;  u-h0i2d“s5 


lh0l% 


;’Jh0i"dqN 


for  ail  (ms>^)  6  \  X  Equation  (4)  and  hence  Theorem  2.1  follow. 

Kote  that,  in  view  o ^Theorem  2.1,  the  pair  (qs,qR)  singled  out  by 
Lemma  2.1  can  be  thought  o£  least-favorable  pair  of  spectral  measures 
for  minimax  linear  smooching.  Concerning  this  pair  of  measures,  we  may 
also  state  the  following  property: 

.  /  -  \  £  ***  y  *7?  qaric£ies  the  conclusion  of 

Theorem  2.3:  The  parr  (qs,qs)  -  "V  x  satrsries 


Lemma  2.1  if  and  only  if  its  maximizes 


min  e  (h;ms>m^)  -  T  JdV*  (mS  +  V 1  % 


h'J'- 


over  all  (m  ,oO  *  *71  x  • 

3  •  S  N 

Proof:  Define  f  -  drn^/d (m3  h m^) .  Then 


in  e(h;m3,m^)  -  (2rr) '^fdnu  «  (2-)'^^  ‘  r',d'a3  ^ 


tain 
h  C  /V. 


Since  C[x]  -  (x-x2)  is  concave  and  twice  continuously  s itfire 
[0,1],  Theorem  2.3  follows  from  Theorem  b.l  of  ■>  ■ 
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3.  Discussion.  Theorem  2.2  gives  Che  general  solution  to  the  minima* 
linear  smoothing  problem  £or  signal  and  noise  uncertainty  classes  of  the 
form  of  (2).  Several  useful  examples  of  classes  of  this  type  are  given  by 
Huber  and  Strassen  in  [4] ,  and  other  useful  examples  are  given  by  Rieder 
[6],  Strassen  [7],  and  Vastola  and  Poor  [8].  Same  of  the  moat  commonly 
used  examples  of  classes  of  the  form  77^  can  be  written  as  «  neighborhoods 
of  same  nominal  measura  u.  Examples  of  capacity  classes  that  have  this 
structure  are  contaminated  mixtures,  variational  neighborhoods,  and  Prohorov 
neighborhoods  (see  [4]).  For  this  type  of  class,  an  uncertainty  model  vill 
consist  of  a  nominal  pair  of  signal  and  noise  spectral  measures 

with  respective  degrees  «s  and  of  uncertainty  placed  on  the  nominal 
measures.  The  derivative  between  capacities  generating  classes  of  this 
type  is  often  of  the  form  (see  Huber  [9,  10]  and  Rledar  [6]) 

^Q(i»)  •maxi'c',  min{c",  X(u>)}},  is  €  .1,  (3) 

where  \  is  the  Radon-Nlkodym  derivative  between  the  nominal  pair  of  measures 

(i.e.,  \  €  <b-,/<fcO  and  c'  and  c"  are  nonnegative  constants  with  c'  £  c". 

5  rv 

If  -  of  (3)  is  a  version  of  dv./dvu,  then  Theorem  2.2  implies  that  a 
u  s  rv 

minima*  linear  smoother  for  V(  and  771  is  given  by 

S  VNf 

hg  (cu )  •  max[  k ' ,  miafk",  h'(ui)}},  u»  €  H  (6) 

where  k'  •  c'/(l  +  c'),  k"  »  c'VCl  +  c")  and  h'  ■  \/(l+\).  Note  that  h'  is 
the  optimum  smoother  for  the  nominal  model,  and  thus  the  minima*  linear 
smoother  for  this  case  desensitises  the  noaulnal  smoother  (to  a  degree  depend¬ 
ing  on  «s  and  c  )  in  those  spectral  regions  where  either  or  1* 
dominant  (i.e.,  where  h'  la  near  1  or  is  near  0). 


In  the  situation*  for  which  (S)  is  valid,  (6)  givas  tha  transfer 
function  of  tha  oiaiaax  linear  smoother.  Suppose,  for  example,  that  n-i, 

-  [-b,b]  for  sens  b  <  »,  c’  <  c",  and  h*  is  synsatric  about  ,u«0  and 
is  strictly  dacraasing  on  [0,b].  Than  tha  mint mar  linear  estimate  of 
determined  by  hQ  is  given  explicitly  by 

S,  *  I  V*  ‘C)Ytdt 

where  hQ  ^  is  given  by 

hQ(t)  «  h’  (t)  *  k’  [sin(bt)  -  sin(a't) ] / (rrt)  +  k''sin(a"t)/ (rrt) 

•  . 

-  ’  h'  (C  -  r)  (jin(b»)  -  #ia(a'->  +  u In (*"-)]  (rrr)~  dr 

u 

with  h'  •  5*l{h'}  and  with  a'  [rasp.,  *"]  the  poaitiva  solution  to  h'(a')  »k' 
[rasp. ,  h' (a")  •  k"] . 

As  a  final  comment  we  note  chat,  although  we  assuaMd  initially  that 
the  observation  field  was  a  continuous-parameter  field,  Theorem  2.2  and 
2.3  are  also  directly  applicable  to  the  case  in  which  the  observation  field 
is  a  discrete-parameter  field  (1.*.,  in  which  the  tlm  set  is  Zn)  since 
this  latter  situations  corresponds  to  tha  particular  ease  of  the  analysis 
of  Section  2  in  which  H  ■  (*W»TT]B» 
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Minimax  State  Estimation  for  Linear  Stochastic 
Systems  with  Noise  Uncertainty 

VINCENT  POOR,  member,  ieee,  and  DOUGLAS  P.  LOOZE. 

MEMBER,  IEEE 

Abstract — The  problem  o I  ntinurex  linear  state  estimation  for  linear 
stodMMk  systems  driven  and  observed  in  noises  whose  second  order 
properties  ate  unknown  la  conaidawd.  Two  general  ejects  of  dda  problem 
are  named  the  sugle-varlabie  problem  with  uncertain  noise  spectre  and 
the  nmhltariabla  problem  with  uncertain  loaasomatalu  noise  correlation. 
General  miaimea  results  ere  preieided  for  cedi  of  them  situations  involv¬ 
ing  characterisations  of  the  minimal  fitters  in  terms  of  least  favorable 
second  order  properties.  Explicit  solutions  are  given  for  the  iprt'lral  band 
uncertainty  model  in  the  single-variable  cases  treated  and  for  a  matrix-norm 
neighborhood  modal  in  the  imdtii  ariabla  case.  Characterization  of  saddr- 
points  In  terms  of  the  estwmal  properties  of  the  mdse  uncertainty  dasaas 
is  also  dhenmed 

l  Introduction 

Several  recent  studies  have  considered  problems  of  minimax  linear 
filtering  and  smoothing  of  stationary  processes  with  uncertain  second- 
order  statistics.  Examples  arc  the  works  of  Kassam  and  Lim  (I),  Kassam. 
Lim.  and  Cimim  (2],  and  Poor  (3],  (4).  In  this  paper,  we  consider  the 
related  problem  of  minimax  slate  estimation  for  linear  stochastic  systems 
in  which  the  process  noise  and/or  observation  noise  processes  have 
uncertain  second-order  properties.  In  particular,  we  consider  the  usual 
linear  system  model 

x, =Ax,  +  B(,-,  t>t0  (1) 

y, =Cx,+d,;  t>f0  (2) 

and  where,  for  each  t»r0,  x,eR*,  y,ER',  f,€R“,  and  A.  B, 

and  C  ate  constant  matrices  of  the  required  dimensions.  We  assume 
throughout  that  ({,.  r€R)  and  (#,;  <ER }  are  orthogonal,  zero-mean, 
wide- sense-stationary  random  processes. 

In  this  paper  we  consider  two  general  aspects  of  the  problem  of  linear 
stale  esumauon  in  <  1 1  and  (2)  for  situations  in  which  the  second-order 
statistics  of  the  noise  processes  are  specified  only  to  be  within  some 
nonparametric  classes.  Specifically,  in  Section  II  we  consider  steady-state 
filtering  for  the  single- variable  case  of  ( 1)  and  (2).  We  assume  that  one  of 
the  noise  processes  {(,;  r€R)  and  {#,;  r€R)  is  a  white  noise  and  that  the 
other  has  an  unknown  spectral  density.  A  minimax  mean-square  error 
design  criterion  is  adopted  and  the  existence  of  a  solution  for  this 
formuUuon  is  demonstrated  for  several  useful  noise  spectral  classes.  In 
Section  III  we  consider  the  multivariable  cam  of  (I)  and  (2)  in  which  ({,; 
>€R)  and  (I,;  t€R)  are  both  multidimensional  white  noises  with  uncer¬ 
tain  componentwise  correlation  matrices.  A  general  minimax  theorem 
which  extendi  a  result  of  Morris  (5|  is  presented  for  this  cate  and  results 
are  given  which  characterize  a  least-favorable  correlation  structure  speci¬ 
fying  the  mini  max  filter. 

II.  Onr-Dimuwonal  Filtirino  with  Unknown  Noise 
SfCCTRA 

A.  General  Formulation 

In  this  section  we  consider  the  particular  case  of  (I)  and  (2)  in  which 
m^n~r*  |  and  <4<0.  Without  lota  of  generality  we  take  C=  I.  For  a 
particular  t>ta,  we  consider  the  estimation  of  the  state  x,  based  on  the 
observation  of  (>,;  r0<v<»)  and  consider  the  steady-state  case  resulting 
from  the  umt  We  assume  that  the  processes  {{,.  ieR}  and 

(#,;  r€R)  have  spectral  denaocs  o,  and  a#.  respectively  Note  that  the 
mean-square  filtering  error  aaaociaifd  with  a  time-invariant  linear  filter 
whom  transfer  function  is  H  is  given  by 

M—rnsi  reeatveS  April  J.  I  SCO:  mn 4  Mart  3.  INI  Piper  memmmteO  by  A 
EparrewPee.  Pm  CRamae  et  an  Imam  Coaawa  Tier  mwl  via  irpaonea  by 
an  Sam  laaia  Elwaaam  riapaa  maw  Cemnei  N000l«-T*c-0«34  m  by  ik 
Orilre  of  Nam  X  mans  mdtr  Canaan  N000U-I  IK-0014 

TW  anban  an  *Mb  aw  Dapartamt  a<  EtaaawM  Eapaawtag  aaa  Un  "irfmuif 
ham  Liharamy.  Uawanny  of  IMaaa  a  Urbaaa  Clwaaaaa.  Uibam.  IL  allot 


E{|x,-2,|J}=(2*)"/*  |l-/f(w)|2«J(wJ+/tJ)',a|(«)dw 

'  -  ac 

-t-(2»)~7“  |W(«)|2o»(w)dw=  (3) 

Let  5Cf  denote  the  class  of  complex-vsiued  transfer  functions  of  causal 
time-invariant  linear  filters.  Then  for  fixed  o(  and  of,  the  optimum  linear 
(minimum  mean-square  error)  state  estimation  filter  is  found  by  solving 
the  problem 

min  S(ff; oi,«a).  (4) 

ffeOC* 

If  o,  and  a,  are  such  that  the  observation  spectrum  o>(u)=(S2(u2  + 
d2)'2»|(ia)  +  o,(w))  satisfies  the  Paley- Wiener  condition.1  then  the  solu¬ 
tion  to  (4)  is  given  by  (Wong  [60 


J2(m2-M2)  'at(u) 


where  superscripts  denote  multiplicative  spectral  decomposition  and  sub¬ 
scripts  denote  additive  spectral  decomposition.  If,  on  the  other  hand,  the 
spectra  <J(  and  are  not  known  exactly,  but  are  known  to  be  in  classes  9t 
and  01,  respectively,  of  spectral  densities,  then  an  alternate  design  crite¬ 
rion  to  (4)  is  the  minimax  mean-square  error  criterion 

min  j  max  S(  ff;  «»,«•)  } .  (6) 


Several  studies  have  considered  problems  related  to  (6).  The  analogous 
noocausal  (smoothing)  case  has  been  considered  for  spectral  band  models 
in  [1],  for  more  general  spectral  density  models  in  [3],  and  for  general 
classes  of  spectral  measures  in  [4],  Some  aspects  of  a  related  causal  case 
have  been  considered  in  [7]  and  more  recently  in  [3],  Note  that  a  saddle- 
point  solution  to  (6)  is  a  point  ( H‘:  <^)€0Cf  x’Xx'X  satisfying 

max  min  S(/f;  ai,  ai  ). 

‘  '  Hex'  ‘ 


(7) 


That  is,  a  point  satisfying  (7)  consists  of  a  least- favorable  spectral  pair 
(«|,  o^JE'Xx'X  and  their  corresponding  optimal  ( rru  nun  urn- mean- 
square -error)  filter  H\  which  is  the  mimmax-mcan-square-crror  filter  for 
0Cx0l.  In  the  following  subsections  we  consider  the  existence  of  such 
solutions  for  situations  in  which  one  of  {{,;  (ER )  or  (9,;  i6R)  repre¬ 
sents  white  noise  and  the  other  has  an  uncertain  spectral  density. 

B.  Wtul*  Process  Noise  with  Uncertain  Observation  Noae 

Suppose,  for  now,  that  the  process  noire  (f,;  ieR)  represents  white 
noire  with  known  spectral  height  Z  and  that  {#,;  ieR)  has  a  spectral 
density  et  which  is  known  only  to  be  in  a  class  “Xc  L,(R)  of  noire  spectra 
where  L|(R)  denotes  the  class  of  absolutely  lntegrabk  real-valued  func¬ 
tions  on  R.  This  problem  now  fits  into  the  framework  developed  in  (3).  In 
particular,  we  hare  the  following  result  which  is  analogous  to  Theorem  2 
of  [3[. 

Theorem  I:  Suppore  X  is  a  convex  class  of  spectral  densities  each 
satisfying  the  Paky- Wiener  condition  and  («,'}  where  «j(w)=S  for 

all  w€R.  Than  ( H‘\  o(.  a,)  with  ff'  optimal  for  (p|',4)  •*  *  saddkpoint 
solution  to  (6)  if.  and  only  if.  solves 

n»s  /(*)  (8) 

•»«*< 

where  the  functional  J  is  defined  by 


whew  e,  is  the  state  spectral  density  given  by 

'Recall  ttiei  thr  NSv-Wmh  .-ooeiuoa  for  •  it  thr  ftaureeat  of  the  tewarai 

l?-ll«ao«•llH•l,  *  ir‘e» 
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«,,(m)  =  B2S(w2-M2)  (10) 

Proof:  It  follows  from  the  results  of  Yao  (8)  that,  for  any  ot  eiX,  we 

have 

min  S(H;a„ej)  =  (^/2)(l-«P{-(*/^V(«#)})  (") 

H€X* 

where  K=B2a|4|.  Since  the  quantity  in  (11)  is  monotonically  increasing 
in  J(at)  the  “only  if  part  of  Theorem  1  follows  immediately.  The  "if 
part  follows  as  a  straighlfoward  modification  to  the  proof  of  Theorem  2 
of  (3) 

Theorem  I  implies  that  a  minimax  state  estimator  for  this  case  can  be 
sought  by  considering  the  maximization  problem  of  (8).  With  respect  to 
this  problem  we  have  the  following  result  from  [4,  Proposition  2): 

Theorem  2:  The  functional  J( o )  is  upper-semiconlinuous  on  L  ,(R )  and 
thus  achieves  its  maximum  on  any  compact  subset  °X. 

Thus,  if  01  is  compact,  the  existence  of  a  solution  to  (6)  is  assured. 
Compactness  is.  of  course,  only  sufficient  (and  not  necessary)  and  may  be 
a  somewhat  restrictive  condition  here  since  £.,(R>  has  noncompact  subsets 
which  are  of  interest  as  spectral  uncertainty  models. 

Example  (Spectral  Band  Model):  A  spectral  uncertainty  model  for 
which  the  robust  noncausal  filtering  problem  has  been  solved  is  the 
spectral  band  model.  This  is  the  model  for  spectral  uncertainty  used  by 
Kassam  and  Lun  ( 1),  and  for  our  case  is  given  by 


w€R.  (2ff  )  J  w )  dii)  —  /*fj 

(12) 


where  P,<x  is  a  fixed  noise  power  and  aL  and  av  are  fixed  power 
spectra  satisfying  the  Paley-  Wiener  condition.  The  solution  to  the  prob¬ 
lem  of  (8)  can  be  found  for  this  case  by  applying  the  results  of  (3| 
concerning  the  relationship  of  the  minimax  filtering  problem  to  an 
analogous  problem  in  hypothesis  testing.  It  follows  from  the  results  of 
Section  III  of  [3]  and  from  the  concavity  of  J  that  the  spectrum  o# 
maximizing  J  over  X  of  ( 12)  is  given  by 

<sf=2srP,q  (13) 


where  q'  is  a  probability  density  in  the  class  ■#„  =  (p|  p=ot/(2wP,y. 
o,60)  that  is  least  favorable  (in  the  sense  of  Huber  (9))  (or  testing  % 
versus  the  probability  density  p ,  =«,//««,  A  recent  result  of  Kassam 
[  10)  gives  q  for  this  case,  and  thus  it  follows  straightforwardly  from  [10] 
and  (13)  that  of  is  given  by 


ot(u); 


4i(w)  = 


ae,(w); 

•t(W: 


if aa  (u)<o,(u) 

if  a,  (u)<aow(  w)«lot(w) 

i  fao,(w)>ol,(w) 


(l«) 


where  a  is  a  constant  chosen  to  satisfy  /*o»  =  2wP,  Note  that  all 
members  of  X  of  ( 12)  must  satisfy  the  Paky-Wiener  condition,  and  thus 
Theorem  I  implies  that  the  mini  max  filter  ff  for  this  case  is  the  minimum 
mean-square  error  filter  for  the  pair  ( a,. op  given  by  (3)  where  op u)=X 
for  all  «€R 

Note  that,  in  general,  4  of  (14)  will  not  yield  a  rational  observation 
spectrum  a,,  so  that  the  mimmax  transfer  function  H’  must  be  found 
numerically  However,  since  H‘  represents  the  causal  projection  of  the 
noncausal  estimate  of  x,  (i.c..  the  estimate  based  on  { yT;  re*}),  some 
insight  shout  the  structure  of  AT  is  derived  by  considering  the  correspond¬ 
ing  noncausal  estimate.  The  transfer  function  of  the  noncausal  estimate 
for  the  spectral  pair  (ef.op  is  H,t  =«,/(«,  -ray)  where  a,  is  from  (10). 
Nose  that  this  transfer  function  represents  a  mini  max  smoother  for  this 
case  (see  (ID.  Using  (14)  we  note  that  H„  is  given  by 


H.A  w )  *  mm{  Hl(  w  ),  max{  * .  Hv(  w ) )  } 

where  *=(l+oj~'.  HL  *e,/(e,  *ol).  and  HL  *e,/(e, -*-et ).  Thus. 
H,  »H„.  and  H„  is  less  sensitive  than  either  HL  or  Hb  in  regions 
where  HL  is  large  or  Ht  is  small  Note  that,  if  Hviu)<k  f or  ill  w€R. 
then  H„  is  effectively  a  version  of  Ht  with  the  gain  limned  to  *  in  regions 
where  Hp*)>k 

The  above  example  illvetraMs  the  structure  of  the  minimax  niter  for  the 
ipcctral-band  uncertainty  model  which  haa  been  applied  frequently  aa  a 


model  for  such  uncertainty.  Other  models  can  be  treated  similarly.  For 
example,  the  class  consisting  of  all  spectral  densities  with  a  given  power 
which  differ  in  L,  norm  from  a  nominal  spectrum  can  be  treated  using  the 
results  of  [3|.  Another  widely  used  model  is  the  mixture  model  which  is  a 
modification  of  (12)  corresponding  roughly  to  the  case  -*  oo  and 
oL  =(1  -«)o0  where  o0  is  a  nominal  spectral  model  with  jo0=2*P,  and  < 
a  degree  ot  spectral  uncertainty  chosen  by  the  designer.  The  solution  to 
maXig.it /( a)  for  this  latter  case  is  given  by  ( 14)  with  the  third  alternative 
never  occurring. 

Before  considering  the  case  of  uhknown  process  noise  statistics,  we  note 
one  further  point.  In  particular,  suppose  the  process  (8,;  r£R)  is  mean- 
square  continuous  but  does  not  have  a  spectral  density.  It  follows  from 
Bodmer's  theorem  (Wong  (6J)  that  (8,;  (SR)  has  a  spectral  measure  m, 
and  from  the  Lebesgue  decomposition  theorem  (see  Royden  [II])  that  we 
can  always  write  mt  =(,  +j,  where  t,  is  absolutely  continuous  with 
respect  to  Lebesgue  measure  and  r«  is  singular  with  respect  to  Lebesgue 
measure.  Moreover,  Snyders  [12]  has  shown  that  the  minimum  filtering 
error  minw«x<£{|x, -i,|2)  depends  only  on  I,.  Thus,  for  any  class  of 
spectral  measures,  we  might  restrict  our  attention  to  the  class  consisting  of 
their  absolutely  continuous  parts  and  Theorem  1  may  still  apply. 

C.  White  Obteroatum  Non*  with  Uncertain  Process  Noise 

We  now  consider  the  alternate  case  in  which  the  observation  noise  (8,; 
r€R)  represents  white  noise  with  spectral  height  8>0  and  the  process 
noise  {{,;  t£ R)  has  spectral  density  o(  which  is  known  only  to  be  within 
a  class  %.  We  assume  that  %  is  such  that  the  class  £={f(w)=e(w)(w2 + 
A2)-1,  o60C)  is  a  subset  of  L,(R).  Within  this  context  we  have  the 
following  analog  to  Theorem  I. 

Theorem  3:  Suppose  9C  is  convex  and  is  such  that  the  members  of  £ 
satisfy  the  Paky-Wiener  condition.  Define  (oy)  where  oy(w)  =  0  for 
all  w€R.  Then  ( H'\  of,  op  with  H'  optimal  for  ( o y,  op  is  a  saddkpoint 
solution  to  (6)  if,  and  only  if,  ot'  solves 

max/(«,)  (15) 

•|6T 

where  the  functional  /(e)  is  defined  by 

/(e)  =  (2*)''©/“  log[l+o(w)*2/(e(«2-M2))]i/iu  (16) 

Proof:  We  note  first  that,  for  any  e(  £0C.  we  have  (see  Yao  [8]) 

mm  £(H;  ay, »,)  =  /( e()  (17) 

Thus,  the  “only  if"  part  follows  immediately.  To  show  the  “if  part, 
suppose  S(£0C  solves  (1 3).  Note  that  the  inequality  log(l  +x)«x  and  the 
assumption  £cL|(H)  imply  that  l(op< oo.  For  e(€0C.  it  follows 
straightforwardly  from  the  concavity  of  log(l+x)  on  (0,»)  and  the 
monotone  convergence  theorem  (Royden  (IID  that 

d/((l-r)e,+re,)/8«|t»0=(2*)*'M2 

/  ■M2)  +  B2e,’(w)]  '[•|(m)-o<( «)]</«  (18) 

Suppose  H'  is  the  minimum  mean-square  error  filter  from  (3)  for  the  pair 
(e^,  op.  We  have  (Yao  |8D 

|l -  //'(w)|2  =9[l  +82»|'(w)/(8(iii2 -M2))]  «ea  (19) 

Thus,  (3),  (18),  and  (19)  imply 

8/((l  -«)e,  +e«^)/9e|,.0=S(  H:ot,ef)-$(H':  op  of)  (20) 

Since  /(e)  is  concave  and  e(‘  maximizes  /(e)  over  0C ,  we  must  have 
8/((l  ~()Of  -*-«»,  )/d«|,  »0  <0  for  all  e,  Thus. (20)  gives  the  left-hand 
equation  of  (7).  and  ( H' .  op  of )  is  a  saddle-point  solution  to  (6).  This 
completes  the  proof.  □ 

Theorem  3  implies  that  the  solution  to  (6)  may  be  sought  for  this  caw 
by  seeking  a  solution  to  ma*  t,/(e,)  We  note  that.  /  is  concave  and 
can  be  shewn  to  be  uppcr-sesmcontimimis.  Thus,  the  solution  to  (6)  is 
assured  if  0t  is  compact.  Again,  if  {(,;  i€R  )  can  have  a  spectral  measure 
which  is  not  absolutely  continuous  with  respect  to  Lebesgue  measure,  the 
white-obtervatioo- noise  caw  might  still  be  treated  via  Theorem  3  by 


i 


•Mb*** 
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resuictiat  attention  to  the  absolutely  continuous  parts  of  the  relevant 
spectra. 

As  an  example,  it  is  again  of  interest  to  consider  the  spectral  band 
model  discussed  above.  However,  it  is  more  reasonable  to  assume  that  the 
power  in  the  state  process  (x,;  /SR)  is  known  rather  than  the  power  in 
the  process  noise  {(,;  teR).  Thus,  we  consider  the  following  modified 
version  of  (12): 

X  =  | a, i«tl( w )<o,( ui )<a,  ( w );  vEQ. 


S(H;I.e)&£{«rC«,}  (25) 

where  £  is  a  positive  semidefinite  matrix. 

The  Laplace  transform  of  (I),  the  assumption  H£'X  *  and  the  assump¬ 
tion  that  {{,;  t€ R)  and  {*,;  rSR)  are  orthogonal  allows  the  weighted 
mean-square  error  (25)  to  be  written  as 

S(/f;i.8)  =  (2iry)''tr{c//“{/-ff(r)C)(r/-^)'' 


(2»)''«:/%,(w)(w;-v.4J)"'i/u  =  />IJ  (21) 


where  aL{  u)(u:  -  4:)  ' 1  and  ((luKu’+zt3]"'  satisfy  the  Paley- Wiener 
condition.  Then  it  is  straightforward  to  show  that  the  solution  to  max  /( a ) 
is  given  for  this  case  by 


«l(«)  = 


f«t(«): 

i/9(«J->-.42); 


( « ) ; 


if  0(w2  +/42)<«1(<1|) 
if  *,.<«)<*(  w2**2)^*)  (22) 
if  0{u2  *-A1)>alJ(u) 


where  0  is  chosen  so  that  <2ir )"  '£2/?„«,<«)(w2  A-A2)~ldu  =  P,.  The 
mini  max  filter  is  then  given  by  (5)  for  the  pair  ( oj.  a, ). 

Another  situation  of  interest  here  is  the  case  in  which  the  process  noise 
is  white  with  unknown  spectral  height.  That  is.  we  have 


X  =  {  <j,o,  ( w )  =  £  forall  u€Q  and  some  £e(a.  A] }  (23) 


BZBr(  -j l-A)~T\  l-H( -s)C]Tds 


+  QjJ°°  H(s)BH(-s)Tdsj  (26) 

where  the  superscript  T  denotes  the  transpose  of  the  indicated  matrix.  / 
denotes  the  it  X  n  identity  matrix,  and  tr  denotes  the  trace  of  the  bracketed 
matrix. 

When  £  and  8  are  fixed  the  filter  which  minimizes  (26)  over  %  *  is 
given  by  the  steady  state  Kalman- Bucy  niter  [16)  (independently  of  the 
weighting  matrix  Q): 


i,-=Ax,  +  fC[y,-Cx, ) 

(27) 

K=PCT&~' 

(28) 

0=AP+  PAr  +  Z-  PCT8  -  'CP 

(29) 

The  transfer  function  of  the  filter  defined  by  (27)-(29)  is  given  by 


where  0 «Sa«A<x  It  follows  immediately  from  (16)  that,  for  this  case. 
o,<  u  l  =  A.  a  solution  which  is  more  or  less  obvious  even  without  Theorem 
3  Note  that  a  similar  result  would  be  obtained  if  we  allowed  ■il  of  Section 
1I-B  to  be  of  the  form  of  (23).  In  fact,  we  note  that  if  either  -X  of  Section 
II-B  or  X  contains  an  maximal  element  oc  (i  e..  an  element  ot  EX  (or  X ) 
such  that  et<  uitsnwi  for  all  «i€Q  and  for  all  seX  (or  X )).  then  the 
corresponding  least-favorable  spectrum  4  or  »,  will  be  the  maximal 
element  This  problem  is  related  to  the  notion  of  a  bounding  filler  which 
is  an  alternate  approach  to  ireaung  problems  of  unknown  noise  statistics 
(see  Nahi  and  Weiss  ( 1 3].  (14)  and  Greenlee  and  Lcondcs  [13]).  Unfor¬ 
tunately.  maximal*  do  not  exist  in  many  models  for  spectral  uncertainty 
i  for  example,  the  spectral  band  models  (12)  and  (21)  do  not  generally 
contain  maximals)  However,  for  white-noise  models  the  role  of  extremal 
points  is  more  important  as  will  be  demonstrated  below  in  the  treatment 
of  the  multivariable  filtering  problem  with  white  noise  of  uncertain 
componentwise  correlation  in  both  the  process  and  observation. 

Ill  Minimax  State  Estimation  roe  Multivaiuasu  Systems 


H(s)  =  [r/-H-tfC)]‘'AT  (30) 

The  corresponding  minimum  weighted  mean-square  error  is 

S(ff,E.8)  =  tr{Cf»(2,8)}  (31) 

where  the  dependence  of  the  error  covariance  matrix  on  £  and  8  is 
denoted  explicitly 

Often  the  second-order  statistics  of  {(,;  r€R)  and  (#,;  r£R)  are  not 
known  exactly.  A  common  representation  of  this  type  of  uncertainty  is 
that  2  and  8  are  cot. lamed  in  sets  XCS"'”  and  XcS"".  respccuvcly 
The  problem  to  be  conside>-:  t  is  to  find  the  best  filter  in  X  *  in  the  sense 
that  this  filter  minimizes  the  largest  weighted  mean-square  error  produced 
when  2  and  8  range  over  all  possible  values.  That  is.  we  wish  to  solve  the 
mini  max  problem 

min  maxS(ff;2.6).  (32) 

Max'  left 
•e* 


A  Formulation 

The  purpose  of  this  section  is  to  consider  the  state  estimation  problem 
for  the  multivariable  form  of  the  linear  stochastic  system  (1)  when  the 
second-order  statistics  of  the  process  and  observation  noises  are  not 
known.  Let  the  dimensions  of  x,,  and  v,  ( it.  m.  and  r.  respectively)  be 
arbitrary  and  assume  that  (f,;  f€R)  and  {*,;  r€R)  are  orthogonal, 
zero-mean,  wide- sense-stationary  white  noise  processes  with  covariances 
£(t,(f)=24(r-T)  and  £(9,*frj=8*(i-r)  where  2  and  8  are  symmet¬ 
ric  positive  definite  matrices.  Also,  it  will  be  assumed  throughout  this 
section  that  ( A.  C)  is  an  observable  pair  and  <  A,  B)  is  a  controllable  pair. 
This  section  will  derive  the  minimax  slate  estimator  when  the  matrices  2 
and  8  are  known  only  to  be  contained  in  subsets  of  the  convex  cones  of 
positive  definite  matrices  Sm*mC Rm*m  and  S'"' CR'"’,  respectively 

We  will  restrict  ourselves  to  consideration  of  causal,  linear  time-invariant 
filters  which  produce  a  wide-sense-stationary  error  process 

xt  —  S,  (24) 

where  (x,;  teR )  is  the  output  of  the  filter  Define  X  *  to  be  the  space  ot 
complex  valued  n*r  matrices  of  Laplace  transforms  of  all  such  filters. 
For  every  H S‘X  *  and  for  every  2  and  8  the  weighted  mean-square 
estimation  error  is  defined  as 


B.  Exit  met  and  Ckaractmtauon  of  a  Saddkyouu 

Two  important  results  concerning  solutions  to  the  mini  max  problem 
formulated  in  Section  III-A  arc  presented  in  this  section  The  first  result 
establishes  an  equivalence  between  a  saddkpoint  solution  to  (32)  and  the 
Kalman-  Buey  filter  corresponding  to  a  particular  ( 2. 8)  pair  The  second 
result  establishes  the  existence  of  a  saddkpoint  when  the  sets  X  and  X 
are  convex  and  compact 

To  obtain  these  results,  we  will  need  the  following  weU-known  theorem 
(cf.  [I?])  which  establishes  the  fact  that  the  existence  of  a  saddkpoint  is  a 
necessary  and  sufficient  condition  for  the  mini  max  problem  (32)  to  be 
equivalent  to  the  corresponding  maxiimn  problem 

max  mina£(/f;X,8).  (33) 


7Vw*m  4:  There  exists  a  triplet  (ffo.Zg.^lEX  *  x»Xx"X  satisfy- 
ioi  the  iuldkpnini  coodiuos 

S(  H„.  2.8)<S(W„;  Xb.Ro)^  H;  2„.80) 


\ 

f 

I 


* 


* 


j 


¥#f€X*.  26“X,  8€*  (34) 


n-tUtt  A,  aaujupMuadasfctf  <4  * 
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if  and  only  if  the  values  of  (32)  and  (33)  are  equal.  Moreover,  a  triplet 
satisfying  (34)  is  a  solution  to  (32)  and  (33). 

Theorem  3  provides  the  desired  charactertzauon  of  a  saddlepomt. 
Theorem  5:  Assume  that  there  exists  Z,  E"X  and  9„  6  ■it  which  satisfy 

tr  {  BtZ0BZ  }  <  tr  { BrZ0  FZg  }  Viet X  (35) 

tr{Ar0rz0Ar0e}<tr{Ar0rz0Ar0e0}  v&ex  (36) 

where  K0  is  the  Kalman- Buev  gain  corresponding  to  Z,  and  9„  defined 
by  (28)-  (29)  and  Zn  is  the  solution  to  the  equation 

( 4-Aroc)rzo+Zo(/4-AroC)j-g=0  (37) 

Let  H^s)  be  the  transfer  funcuon  of  the  Kalman- Buev  filter  (30) 
corresponding  to  Zo  and  90.  Then  ( ff0.  Zg.  90)  is  a  solution  to  (32)  and 
(33) 

Conversely,  suppose  that  ( f/0,Zg,90)  is  a  saddlepomt  for  (32).  Then 
H0  is  (he  transfer  function  of  the  Kalman-  Bucy  filter  ( 30),  and  Zg  and  90 
satisfy  (35)-(37). 

Proof: 

Sufficiency:  Consider  the  maximin  problem  (33).  For  fixed  Z  and 
9.  the  solution  of  the  minimizauon  is  given  by  the  Kalman- Bucy  filter 
(27)-(29)  The  filter  transfer  function  H(s)  is  given  by  (30)  and  the 
mean-square  error  is  given  by  [see  (26)] 

J?(/7;Z,9)=(2wj)~'tr  -  W(r)C](s/— -4)*1 

BZBT(-sl-A)~T[l-H(-s)C]Tds 

-(?/'”  W(r)9/f(-^)rdi}  (38) 

Since  (38)  is  addiuvely  separable  in  Z  and  9,  the  joint  maximization 
required  by  (33)  can  be  carried  out  separately 
First,  consider  the  process  noise  Assume  Zg  satisfies  (35)  and  (37)  for 
fixed  9  The  soluuon  to  (37)  is  given  by 

Z0*/%M-*.C>r'0fM-*.c>.<*  (39) 

'0 

Substituting  1 39)  in  (35)  and  using  a  simple  trace  identity  fives 

tr  {  a rZ0  BZ }  =  tr  j  Q  f*e'  4  ~  K*  "BZB  re,4-K°c‘r'dij  (40) 

By  Parscval’s  theorem,  (40)  can  be  written  as 
tr  { «rZ0«X }  =(2*j )~ '  tr  {(?/'"_  [rf-(  A -K^)]' 1 

BZ$T[-sl-(  4  -KgC)]' rdr  J  (41) 

However. 

(s/-(  . 4 -ATgOl  ' =  [/-(s/- 4  *KgC)  lK^](s/-4)'’ 

»[/-f7(r)C]<r/-4)  ’  (42) 

Substituting  (42)  into  (41)  gives  the  first  term  on  the  nght-hand  side  of 
(38)  Then  (35)  implies  (38)  is  mint  mired  over  •X.  which  establishes 

f  (  H0.  X,9)*£(  H0,  Ig,9)  (43) 

Now.  aasume  Z  is  fund  and  0g  satisfies  (36).  (37)  Subsutuung(39)  into 
(36)  and  manipulating  the  trace  gives 

tr  {  *,rZ0*g»  >  ■  tr  [  Q  f%‘  4  -  **  "Kr/BK^e'  4**'T*}  (44) 

Again  uang  Paneval's  theorem,  we  obtain 
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tr  {  KgZ0K09)  =(2irj  f '  tr  {  Q /'*  l*/-<  -4  "  ATgC))' ' 

^09ACg  [  -  sl-(A  -ffgC)]"  rdr| 

=(2*j)',tr[e/^“  H(r)9H(-r),'di}  (45) 

Thus.  (36)  implies  (38)  is  minimized  over  X  which  establishes 

S(ff0,Z.9)<S(ff0;Z,90).  (46) 

Inequalities  (43)  and  (46)  together  with  the  aforementioned  separability  of 
(38)  imply 

S(H0;Z.9)«;S(//0;Z0,90).  (47) 

The  upper  inequality 

S(Hg;Z0,90)cS(H;Z0.90)  (48) 

follows  trivially  from  the  fact  that  H0  is  the  minimum  mean-square  error 
estimator.  Thus.  ( H0,  Zg, 90)  is  a  saddlepomt  for  (32)  and  (33)  and.  by 
Theorem  4,  also  a  solution  for  (32)  and  (33). 

Necessity:  Suppose  (ff0,  Zg,90)  satisfies  (34).  By  Theorem  4. 
(//0.Zg,90)  solves  (33).  Hence  ff0  is  the  transfer  function  of  the 
Kalman- Bucy  filter  Let  F(Z.9)  =  £{c,c;r}  when  (Z.9)  are  the  second- 
order  statistics.  Then  F(Z,  0)  is  given  by 

0=( 4  -  Af0C)/>( Z.  9)  + />(  Z.  9)(  4  -  ATgC) BZflr + Ar0eAf0r 

(49) 

Thus,  the  difference 

AF-  />(Z.9)-F(Z0,9g)  (50) 

is  given  by 

O=(A-K„C)AF  +  .iFM-KoC)V*(Z-Z„)Br-v*o<0-0o)A:„,\ 

(51) 

Thus,  A  F  is  given  by 

AF  =  f~e'4-K*»[B(Z-Z0)BT  +  K<,{9~9',)Kf]e'4-*'*'r'Jt 

(52) 

By  (34) 

,r  (CdF)  =  £(  W0;  Z,9)-£(  f/0;  Zg.90)«0  (53) 

After  substituting  (52)  into  (53).  using  a  simple  trace  identity,  and  using 
(he  definition  of  Z„  ((39)  and  (37)|.  we  obtain 

tr{Zg(Z-Zg)}  +tr{Z0(9-90)}<0  t'leiX.  9e“V  (54) 

In  particular.  Z  =  Zg  implies  (36)  and  9  =  90  implies  (35)  C 

Thus,  we  see  that  conditions  (35)— (37)  are  equivalent  to  the  existence  of 
a  saddlepomt.  If  such  a  saddlepomt  exists  then  the  mini  max  filter  is 
simply  the  Kalman- Bucy  filter  for  the  <Ig,90)  pair  which  satisfies 
<35)-(37)  This  result  can  be  used  to  establish  the  existence  of  a  saddle 
point 

Theorem  4.  If  'X  and  X  are  convex,  compact  subsets  of  Sm‘m  and 
S’",  respectively,  then  a  saddlepomt  solution  for  the  mini  max  problem 
(32)  exists. 

Proof:  The  proof  shows  that  a  soluuon  to  the  maximin  problem  (33) 
exists  and  satisfies  conditions  (35)-(37)  of  Theorem  5  By  (26)— (31). 

^min  £((f;  X.9)»tr {(?F(I.9)J  (55) 

is  continuous  in  X  and  9  Since  X  and  'X  are  compact,  a  solution  to  (33) 
exists.  Let  ( f7.  Xg,  9g )  be  such  a  solution. 
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Since  S(  H:  2,8)  is  concave  in  2  and  8  for  every  HEX  *  (see  (26)], 
minHe5[-S(W;  2.8)  is  also  concave  in  2  and  8.  Hence,  (lie  Frtchet 
differential  of  (55)  must  be  nonpositive  in  every  direction  into  the  set 
XxX.  Let  8P{ 2„,80,;  A2, A8)  denote  the  Frtchet  differential  of  P 
evaluated  at  ( 20.8,5)  in  the  direction  (A 2.  A8)  By  (29). 

0  =  (A-KoC)8P  +  SP{A-KoC)T+B±ZBr->-KnMKZ  (56) 

where  the  dependence  of  8P  on  2.  8.  .12,  and  A8  has  been  suppressed. 
Thus,  8P  is  given  by 

SF=  SA2flr  +  Af0A8  Af0r]el',_Ar°c’r'dr.  (57) 

Using  (57)  and  a  simple  trace  identity,  the  Frtchet  differential  of  (55) 
becomes 

tr  {0«F(2o.  80;  A2,  A8) }  =tr  {  BtZ0BClZ)  +  tr  { Af0rZ0/f0A8} 

(58) 

where  Z0  satisfies  (37). 

Consider  an  arbitrary  point  (2.8)  eXxX.  Since  X  and  01  are  convex, 
the  line  segment  joining  ( 2,, 80 )  and  (2,8)  is  in  XxX  and  hence, 

(A2.A8)  =  (2-20,8-80)  (59) 

is  a  direction  into  X  x  X  Substituting  (59)  into  ( 58)  and  requiring  ( 58)  to 
be  nonpositive  gives 

tr{BrZ0fl(2-2„)}-tr{8f0rZ0Ar0(8-e0)}<0.  (60) 

The  choice  (2,8)=(2,80)  in  (60)  gives  (35)  while  the  choice  (2,8)  = 

( 2,.  8)  in  (60)  gives  (36).  Thus,  by  Theorem  5.  (2o, 80)  is  a  saddlepoint 
for  (32).  □ 


Then  each  set  has  a  maximal  element 

*o  =  £n  +V  (63) 

80=8*+t )/  (64) 

and  the  minimax  niter  is  the  Kalman- Bucy  filter  corresponding  to 

(io*80).  O 

Another  reasonable  model  for  uncertainty  in  the  second-order  a  prion 
statistics  of  system  (I)  and  (2)  is  that  the  spectral  density  matrices  are  a 
convex  combination  of  two  possible  nominals  Let 

X=  {2:  Z=AZ,  +  (I-A)22;  2,  >0,  Z2  >0,  Ae[0,  l)}  (65) 

X  =  (8:  8  =  y8|  +  (1  -y)82;  8,  >0,  82  >0,  y€[0, 1]}.  (66) 

Suppose  that  (2o,80)  satisfy  (35)— (37)  and  let  A0  and  y0  be  the  corre¬ 
sponding  constants  defined  by  (65)  and  (66).  Then  (35)  and  (36)  become 

(A-\0)tr{BrZ0B(2,-22)}<0  (67) 

(y-Yo)tr{Ar0rZoA-o(8l-e2)}<0.  (68) 

Conditions  (67)  and  (68)  imply  that  (35)  and  (36)  can  be  satisfied  with 
strict  inequality  for  all  Z^Zo  and  8#80  only  if  2g  and  80  are 
endpoints  of  the  set.  Otherwise,  we  must  have 

tr{«rZ0fl(Z,-Z2)}=0  (69) 

tr  {KoZ„Ar0(8|  -82)}  =0  (70) 

where  Z0  and  K0  depend  on  A0. 

IV.  Summary 


C.  Discussion 

Theorem  5  provides  an  equivalent  characterization  of  a  saddlepoint  in 
terms  o l  quantities  associated  with  the  Kalman  filter  of  a  particular 
( 20, 8„ )  pair.  Theorem  6  establishes  the  existence  of  a  saddlepoint  when 
X  and  X  are  convex  and  compact.  Taken  together.  Theorems  5  and  6 
show  that  the  minimax  filter  which  solves  (33)  when  X  and  X  are  convex 
and  compact  is  given  by  the  Kalman  filter  corresponding  to  the  ( 2, ,  80 ) 
pair  which  maximizes  tr  {(?/>( 2. 8)). 

There  are  two  important  cases  for  which  the  conditions  (35)-(37) 
provide  explicit  solutions.  The  first  occurs  when  the  set s  X  and  X  have 
maximal  elements.2  and  the  second  is  when  the  sets  X  and  X  are  each 
line  segments. 

Theorem  X  Suppose  2,  eX  and  80  eX  are  maximal  elements  in  their 
respective  sets  Then  2^  and  80  satisfy  (35)— (37). 

Proof  By  the  controllability  and  observability  aasumpuons  on  the 
system  (1)  and  (2).  the  closed-loop  filter  matrix  (A-K^C)  is  asymptoti¬ 
cally  stable.  Hence,  Z0  defined  by  (37)  is  positive  definite.  The  maxunality 
of  2q  and  80  thus  implies  (35)— ( 36).  Q 

Theorems  5  and  7  combined  imply  that  if  the  sets  X  and  X  each 
contain  a  maximal  element  then  the  Kalman-  Bucy  filter  corresponding  to 
those  elements  is  a  immmax  filter.  This  result  is  a  precise  expression  of 
the  intuitive  design  procedure  of  using  the  worst-case  noises.  It  generalizes 
the  concept  of  a  bounding  filter  (see  Nahi  and  Weiss  (I4|)  to  the 
multivariable  estimation  problem.  One  application  given  by  the  following 
example  demonstrates  the  intuitive  nature  of  the  result. 

txampk  A  common  method  of  modeling  uncertainty  in  the  second 
order  a  pnon  statistics  of  system  ( I )  and  (2)  is  to  choose  a  nominal  pair 
1 2V .  8V )  and  assume  that  the  true  X  and  8  differ  from  the  nominal  in 
norm  by  no  more  than  some  posiuve  constant  g.  Assume  that  the  norm  to 
be  used  for  each  matrix  is  the  norm  induced  by  the  Euclidean  vector  norm 
on  the  underlying  space  1  Hm  and  X'.  respectively).  Define 

Xi  (Z  '!X-X¥.!«e.X>0)  (61) 

X  *  (8:  il8-8vil«e,8>0)  (62) 

'A  Mual  iliwnm  a  femeth  4  «"■■■  stiau.ii  mk*  that  il.tt 
*•«*»•  Haw  e»  ««asn«a  i»i  o  Ssftsa*  «a  «- »  Sana  »»«*»  am4ihwu 


Minimax  approaches  to  two  problems  associated  with  the  estimation  of 
the  state  of  a  linear  stochastic  system  have  been  considered.  The  scalar 
problem  for  which  the  noise  spectrum  is  uncertain  was  examined  in 
Section  II.  The  minimax  filter  for  this  problem  was  characterized  in  terms 
of  the  least  favorable  noise  spectrum.  Section  III  considered  a  similar 
problem  for  the  multivariable  estimation  problem  when  the  noises  are 
white  processes  with  unknown  a  prion  second-order  statistics.  The  saddle¬ 
point  solution  for  this  problem  is  given  by  a  Kalman- Bucy  filter  for  a 
least-favorable  pair  of  componentwise  correlation  matrices. 
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SPECTRAL  UNCERTAINTY  ON  WIENER  FILTERING 
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Abstract 


Results  from  an  extensive  study  of  the  performance  of  Wiener  filtering 
under  spectral  uncertainty  are  presented.  For  a  variety  of  spectral 
uncertainty  models  the  Wiener  filter  is  shown  to  have  unacceptable  sensi¬ 
tivity  to  even  small  deviations  from  those  signal  and  noise  spectral 
densities  which  were  used  to  design  the  filter.  Performance  of  a  robust 
filter  (designed  to  have  the  best  possible  performance  when  the  uncertainty 
is  worst)  is  also  examined.  In  most  cases  the  robust  filter's  insensi¬ 
tivity  to  spectral  uncertainty  makes  it  preferable  to  the  traditional 
Wiener  filter. 
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1.  Introduction 

The  solution  to  the  traditional  stationary  linear  (i.e.,  Wiener) 
filtering  problem  requires  exact  knowledge  of  the  signal  and  noise  spectra. 
Often  in  practice  it  is  unrealistic  to  assume  such  knowledge.  Despite  this, 
Wiener  filters  are  widely  used  for  steady-state  filtering.  In  this  paper 
we  consider  the  performance  of  Wiener  filtering  when  the  signal  and  noise 
spectra  differ  to  a  small  degree  from  those  assumed  in  the  design  process. 

In  particular,  in  Section  2  we  consider  the  Wiener  filter  for  a 
particular  signal  and  noise  spectral  pair  which  would  be  natural  to  assume 
is  the  true  spectral  pair.  We  then  look  again  at  our  circumstances  and 
model  the  uncertainty  we  might  have  about  our  choice  of  spectra.  In  so 
doing  we  find  that  the  potential  exists  for  totally  unacceptable  perfor¬ 
mance  degradation  in  the  presence  of  even  small  degrees  of  uncertainty. 

In  Section  3  we  consider  filters  termed  "robust".  These  filters  are 
designed  to  have  the  best  "worst-case"  performance  over  uncertainty  classes 
of  spectra.  The  method  of  design  is  due  to  Poor  (1980)  and  was  based  on 
the  work  of  Kassam  and  Lim  (1977) .  As  we  will  see,  the  advantage  of  these 
robust  filters  is  that  they  are  least  sensitive  in  the  sense  that  they  have 
the  smallest  possible  maximum  deviation  from  optimality  within  the  constraints 
Imposed  by  our  uncertainty. 

Of  course  there  is  a  trade-off  involved  in  robust  filtering.  While 
the  robust  filter  has  better  worst-case  performance,  we  cannot  expect  it 
to  have  optimal  performance  should  our  original  choice  of  spectra  be  the 
true  ones.  In  Section  3  we  will  consider  this  trade-off  as  well. 
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2.  The  Seni-ctiv-ctu  jj  tie  'JlieneA  f-iZte/i  to  SpecVuU.  UnceAXainty 

The  mean-square-error  (MSE)  for  linear  filtering  of  a  signal  in  uncorrelated 
additive  noise,  where  both  signal  and  noise  are  modeled  as  real,  zero-mean, 
second-order,  wide-sense  stationary  random  processes,  is  given  by 


e(o,v;H)  -  J[o(ut)  |  l-H(ui)  J  +  v(co)  |  H(w)  |  ]  dw  , 


where  H  is  the  transfer  function  of  the  filter  and  o  and  v  are  the  power 
spectral  densities  (PSD's)  of  the  signal  and  noise,  respectively.  For  a 
fixed  signal  and  noise  spectral  pair,  (o,v),  e(c,v;H)  is  minimized  by 
the  Wiener  filter 


H*(u) 


_ _ 

a(u)  +  v(u) 


and  the  minimum  MSE  is 


e  (o,v)  A  e(a,v;H*)  ■  r  H*(w)  v(w)  du>  .  (3) 

ZTT  U 
-.00 

Unfortunately,  as  we  discussed  in  the  introduction,  it  is  often  the 

case  in  practice  that  our  knowledge  of  the  signal  and/or  noise  PSD's  is 

* 

inexact.  If  the  a  and  v  we  choose  for  designing  H  are  not  the  true  spectra, 
then  our  filter  will  generally  have  less  than  optimal  performance.  To 
illustrate  the  degree  of  performance  degradation  that  can  result  from  such 
mis-modeling,  we  consider  the  following  examples.  The  numerical  results 
presented  here  and  in  the  following  section  comprise  a  representative 
selection  from  an  extensive  numerical  study  (Vastola,  1981). 

The  p-pcint  C&L44 .  For  a  number  of  applications  it  is  natural  to 


i 
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asstime  Chat  we  have  a  narrow-band  first-order  Markov  signal  in  wide-band 
first-order  Markov  noise,  i.e.  that 


and 
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2  2 

where  are  the  3  dB  bandwidths  and  vg  and  v^  are  the  powers  of  the 

signal  and  noise,  respectively.  For  Fig.  1  we  have  a  ■  10  and  a  ■  1. 

N  o 

In  the  figures  of  this  paper  we  have  used  a  measure  of  performance 
which  we  refer  to  simply  as  output  signal-to-noise  ratio  (SNR) .  The  purpose 
of  Wiener  filtering  is  to  minimize  the  MS£,E{[S(t)  -  S(t)]2}, 
between  our  estimate  S(t)  (i.e.  the  output  of  Che  filter)  and 
the  actual  signal  S(t) .  Since  the  output  of  the  filter  can  be  written  as 
S(t)  +  (S(t)  -  S(t)),  we  use  the  signal  power  divided  by  the  MSE  as  an  output 
SNR.  For  the  purpose  of  our  graphs  we  translate  this  to  dB.  The  horizontal 
axis  is  10  log1Q(10v2/v2) ,  the  input  SNR  in  dB. 

* 

The  top  line  in  Fig.  1  gives  the  performance  of  the  Wiener  filter  Hq, 
designed  using  oq  and  mq  of  (4)  in  equation  (2), when  oq  and  vq  are,  in  fact, 
the  signal  and  noise  spectra  which  occur.  For  this  case  it  is  straight¬ 
forward,  via  equation  (3),  to  show  that 


where 


*. 

e  (a 


o’V/vS 


5aN//asaJ 


2,2  w 
r  +  asV(a 


Sr  +  V 
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Now,  suppose  that  the  only  information  about  which  we  are  certain  is 
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2  2 

ch«  powers  of  che  signal  v~  end  Che  noise  and  chac  we  have  estimated 

wich  sufficient  accuracy  che  fraccional  power  of  aach  on  che  sec 

S  ■  {■»  real!  j.  <,  1;.  Ve  denoce  che  signal  and  noise  fraccional  powers  by 

-1  2 

p$  and  ?N,  raspeceively  (e.g.  (2^)  /g  o(w)du  -  p^) .  In  parcicular,  for 

che  exaaqile  considered  above,  we  have  pe  ■  .5  and  pu  •  .063.  If  Chase 

cocal  powers  and  fraccional  powers  are  all  we  can  really  be  certain  of,  we 

would  like  co  know  how  badly  che  performance  of  H*  can  deteriorate.  The 

o 

bottom  line  in  Fig.  1  gives  che  worst-case  performance  of  H*.  The  middle 

o 

line  represents  what  we  can  do  trivially  for  any  pair  of  spectra  by  using  an 
all-pass  filter  (H  =  1)  when  che  input  SNR  is  positive  and  by  using  a  no¬ 
pass  filter  (H  =  0)  when  che  input  SNR  is  negative.  Thus  we  see  chac  if 
che  spectra  are  actually  first-order  Markov  then  our  filter  does  well,  but 
if  not  we  can  do  ^gni^canttu  wonML  than  CUvicU.  ^iZtz’u.ng . 

Finally  we  noce  chat  uncertainty  classes  of  spectra  given  by  assuming 
exact  knowledge  only  of  the  total  and  fractional  powers  are  called  p-point 
classes  and  have  been  studied  as  models  of  spectral  uncertainty  by  Cimlni 
and  Kassam  (1980) .  An  analogous  uncertainty  class  for  probabilities  used 
in  robust  hypothesis  testing  and  robust  detection  has  been  examined  by  the 
authors  (Vaatola  and  Poor,  1980)  and  by  El-Savy  and  VandeLinde  (1977,  1979). 
The.  e- contamination  ctcU4.  Suppose  chac  we  again  have  a  particular 

spectral  pair  (o  ,v  )  which  we  believe  to  be  the  true  spectra,  but  that  we 
o  o 

also  have  a  general  sense  of  uncertainty  about  our  choice  which  we  model  by 
an  e-contaminated  class;  i.e.,  we  assume  we  know  that  che  true  spectra 
satisfy  (c,v)  *  \  where  0  <  e  <  1, 
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i  _  •  -jj  j(uj)  •  (i-e)o  (ui)  *  cj'Ojb)  1R 


•  * 

,  j ff'(w)  dw  ■  y*30(“)  <*"/ 


and 


?!.  *  H  v(w)  -  (l-c)vo(u)  +  cv'iut )  u«TR,  J’v’C'm)  du  •  ^  v^(u)  dw) 


Classes  of  this  fora  have  basn  used  extensively  as  general  node Is  of  uncer¬ 
tainty  (Tukey,  1960;  Huber,  1965;  Kassam  and  Lin,  1977;  Hosoya,  1978). 

* 

Fig.  2  gives  the  performance  of  the  Wiener  filter  Hq  designed  via 

equation  (2)  assuming  a  narrow-band  (o^  ■  1)  first-order  Markov  signal  in 

wide-band  (a^  -  1000)  first-order  Markov  noise.  The  upper  line  gives  the 

performance  of  this  filter  when  these  are  the  true  signal  and  noise.  The 

lower  line  is  the  worst  esse  of  this  filter  over  the  uncertainty  classes 

in  (5)  with  a  and  v  given  by  the  above  choices  and  withe*  .1.  We  see 
o  o 

that,  for  values  of  input  SNR  near  zero,  the  worst  case  is  better  than  trivial 

filtering  but  still  much  worse  than  optimal  (about  8.5  dB) ;  for  values  of 

input  SNR  greater  in  absolute  value  than  60  the  performance  in  both  the 

nominal  and  worst  cases  is  the  same  as  trivial  filtering;  and  for  all  other 

values  the  worst  case  is  worse  than  trivial  filtering. 

An  c-contaminatzcL  signal  in  whitz  noibZ.  Fig.  3  shows  the  nominal 

and  worst  case  performance  of  the  nominal  Wiener  filter  for  the  signal 

uncertainty  class  in  (5)  with  t  •  .1  and  oq  first-order  Markov  with 

*s  -  1.  The  noise  is  white  noise  with  no  uncertainty  and  the  horizontal 

2 

axis  is  actually  the  ratio  of  signal  power  v^  to  the  noise  level  Kq/2.  Rote 
that  the  worst  ease  is  bounded  above  by  10;  in  fact,  for  any  choice  of  e, 
it  is  bounded  above  by  10  log(10e). 

As  noted  above,  the  optimal  and  worst-case  performance  of  Wiener 


filtering  under  various  conditions  hss  been  examined  extensively  for  several 
uncertainty  models  and  for  a  variety  of  signal  and  noise  parameters 
(such  as  bandwidth  and  power).  The  above  examples  are  representative  of 
the  sensitivity  of  Wiener  filtering  to  deviations  from  spectral  assumptions 
which  were  found  in  virtually  every  case. 

3.  Robui-f  W-teneA  Ft ttiM 

To  remedy  the  problems  of  Wiener  filtering  sensitivity  discussed  In  the 
preceding  section,  we  consider  the  following  robust  filter  design  which 
was  developed  by  Poor  (1980)  based  on  the  work  of  Kassam  and  Lim  (1977). 

A  moat-robust  Wiener  flltar  (Poor,  1980)  is  a  solution  to  the 

game 

min  sup  e(o,v;H)  (6) 

H  (^,v)«^x7I 

where  J  and  ?!  are  classes  of  spectra  representing  uncertainty  in  the  signal 

and  noise, respectively,  and  where  e(<?,v;H)  is  given  in  (1).  Note  that 

* 

since  the  sup  in  (6)  gives  the  least  upper  bound  on  the  error,  HR  is  a 

a 

filter  with  the  smallest  possible  such  upper  bound.  In  other  words  HR  is 
least  sensitive  to  worst  case  uncertainty. 

A  pair  of  spectra  iS£i£  favorable  for  Wiener  filtering  for 

the  spectral  uncertainty  classes  J  and  V  (Poor,  1980)  if 

e(o,v;E^)  <.  •(sl,vL’^L>  ^ 

for  all  a  <  v  c  7[  where  h£  Is  the  Wiener  filter  for  the  pair  (s^.v^) 
as  in  (2). 

It  is  straightforward  to  see  that  if  (o, ,v^)  t  J  x  71  is  least  favorable 


for  Wiener  filtering  for^  and  "l  then  the  pair  ia  a  saddle-point 

solution  to  the  mini max  game  (6).  That  is. 


it  it 

sup  e(a,v;HT)  -  e(aT,vT;H-)  »  min  e(aT,vT;H).  (8) 
(e.v)«*>x*  L  L  L  -L  H  L  L 


We  see  from  this  that  if  (o^,^)  is  least  favorable  then 
Wiener  filter. 

Thus  we  see  that  if  we  can  find  a  least-favorable  pair  then  we  can 

design  a  most-robust  Wiener  filter.  One  of  the  methods  developed  in 

(Poor,  1980)  for  finding  least  favorable  pairs  of  spectra  (and  hence  most- 

robust  filters)  Involves  an  analogous  concept  in  hypothesis  testing:  least- 

favorable  probability  density  functions  (PDF's)  for  testing  one  set  of  PDF's 

against  another.  Least-favorable  PDF's  have  been  found  for  a  variety  of 

classes  of  PDF's  (Huber,  1965;  Kassam,  1981;  Vastola  and  Poor,  1980).  If 

2 

every  signal  spectrum  in  J  has  the  same  finite  power  vg  and  every  noise 

2 

spectrum  in  7[  has  the  same  finite  power  v^  then  we  can  define  classes  of 
PDF's 

2 

*s  ■  {fg|fg(w)  ■  o(u)/2irVg,  oet/} 

and 

9^  "  "  v(w)/2vv2,  veTD 

and  possibly  apply  the  following  (Poor,  1980,  Corollary  1). 

2  2 

Theorem.  If  tf  and  71  are  convex  and  have  constant  powers  v^  and  v^,  respect¬ 
ively,  and  qs  t  9$  and  e  9^  are  least-favorable  PDF's  for  9 g  versus  0^ 
then  oL  ^  2Trvsc,S  VL  ^  2rVNqN  *r*  laaat  favorable  spectra 
for  Wiener  filtering  for  tf  and  71. 

This  theorem  allows  us  to  construct  most-robust  Wiener  filters  for  the 


is  a  most-robust 


V 
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first  two  examples  considered  in  Section  2. 

The.  p-point  ctxii .  It  can  be  seen  from  the  above  theorem  and  Vastola 
and  Poor  (1980)  or  from  Cimini  and  Kassam  (1980)  that 


<<-> 


- 2 - 2  for  “  *  s 

I  PSVS  +  PNVN 

(1_ps)vs  c 

- s - ~  for  a)  c  S 

(1’PS)VS  +  (1~PN)VN 


and ,  hence , 


e 


(o,v;h£) 


PSPH  .  <l‘ps>  (1‘V 
Psr  +  pa  +  U-ps)r  +  <l-pN)  *«  *u  fe-P) 


A  2  2 

where  r  ■  vs^vN’  *nPuC  SNR.  In  Fig.  4  we  have  superimposed  onto  Fig.  1 
the  performance  of  HR  (the  middle  line).  It  is  clear  from  Fig.  4  that, 
unless  we  are  extremely  certain  about  our  choice  of  a  and  v,  H*  is  prefer- 

K 

* 

able  to  H  . 

o 

Thu  e-contamirurtzd  C&144.  For  the  classes  in  (4)  it  can  be  easily 
seen  from  the  above  theorem  and  Huber  (1965)  that 


<(“) 


k’  -  e’ 

(  b!<»> 


c'r/(c'r  +  1)  for  H  (ui)  <.  k' 

o 


for  k’  <  H  (u>)  <  k" 
o 


|  k"  a  c"r/(c"r  +  1)  for  H*(w)  >  k”  , 

where  0  i  c’  <  c"  i  -  are  constants  given  in  Huber  (1965).  It  is  interesting 
to  note  that  the  robust  filter  H*  has  this  same  form  for  several  other 
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* 

uncertainty  models  (Poor,  1980;  Huber,  1977).  Also  note  that  this  H^  will 
not  have  constant  MSE  over  J  *  n  as  in  Che  previous  example.  In  Fig.  5 
we  have  superimposed  onto  Fig.  2  the  performance  of  H*  when  the  true  spectral 

K 

pair  is  (a0>vQ)  (the  second  line  from  the  cop)  and  when  the  true  spectral 
pair  is  (g^.v^)  (the  third  line  from  the  top).  Recall  from  the  definition 
of  (cL»vL)  that  the 
example  c1  ■  1/c"  *  .125. 

Unlike  the  preceding  example,  the  preferability  of  the  most* robust  filter  is 

not  so  clear-cut.  If  one  were  relatively  certain  about  (<7q»Vq)  being  correct  then 
* 

Hq  would  be  the  better  choice ;  however,  if  not,  and  if  the  guaranteed  level  of  performance 
over  Jx  7 \  (given  by  the  third  line  down)  were  adequate,  we  would  likely  choose  H^. 

An  s- contaminated  -iignat  -in  white,  noi&e.  Clearly  the  above  theorem 
cannot  be  applied  to  find  a  robust  filter  in  this  case  since  the  noise  has 
infinite  power;  however  a  more  direct  approach  proves  fruitful  here.  First, 
we  may  restrict  our  search  to  He  L^tdu),  the  mean-square  integrable  functions 
on  R,  the  real  line,  since  all  others  have  infinite  MSE  regardless  of  what 
j  is  (cf.  equation  (1)).  Second,  we  have,  for  all  He  l^du). 


latter  is  the  worst-case  performance  of 


For  this 


sup  e(o,vo;H)  -  sup  ±  J  [  |  l-H(to>  |  "  ( (1-c  )aQ  (u»)  +  ea'(u>))  +  |H(u,)|'-~l  ^ 


2  * 


eta/ 


auf. 


!((l-t)oo.vo;H)  +  e  sup  -jj-  J  j  1-H(u») |  *(«*»)  du 


1R 


-  e((l-«)o  ,v  ;H)  +  ev 
0  o  a 


The  last  step  is  true  because  o'  is  assumed  to  be  integrable  and  H  c  l^du). 


Hr 

Clearly  this,  equation  (2),  and  (6)  (the  definition  of  H  )  imply  that 

(l-E)Oo(u.) 

aR(Ui)  “  <l-£)0  (u»)  +  N  /2 

o  o 

Recall  thac  Fig.  3  shoved  the  performance  of  H*  in  this  situation  in 

its  nominal  and  worst  cases.  If  we  superimposed  the  nominal  and  worst 
* 

cases  of  onto  Fig.  3,  as  we  have  done  for  the  other  examples,  we  would 

find  no  change;  i.e.,up  to  the  accuracy  of ' the  graph  the  nominal  cases  and 
*  * 

worst  cases  of  Hq  and  are  the  same.  In  fact,  they  differ  by  no  more  than 
.01.  It  should  be  noted  that  this  is  a  singular 

example  and  the  unusual  performance  is  due  to  the  infinite  power  of  the  white 
noise,  not  to  the  "very  wide  bandedness"  which  white  noise  is  generally  used 
to  model. 

IV.  ViAOJAAian  and  ConchuA-ianA 

As  we  have  discussed  above,  the  results  presented  in  this  paper  (with 
the  one  exception  of  the  white  noise  example)  are  representative  of  our 
findings  in  a  wide  variety  of  cases.  For  example,  although  it  is  a  much 
harder  case  to  solve,  we  have  developed  numerical  results  for  causal  Wiener 
filtering  of  an  e-contaminated  first-order  Harkov  signal  in  first-order 
Markov  noise.  The  theory  of  the  causal  case  has  not  been  developed  in  the 
same  generality  as  the  noncausal  case;  however,  this  specific  example  can  be 
treated  using  the  results  of  Poor  (1980)  and  Yao  (1971).  In  Fig.  6  we  have 
presented  the  results  for  this  causal  filtering  example  with  e  -  .1,  •  1 

and  *  1000.  For  comparison  we  have  also  Included  Fig.  7  which  gives  the 
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results  for  the  corresponding  noncaussl  case.  Note  the  similarity  between 
the  two  figures.  Again,  this  is  indicative  of  our  findings  over  a  wide 
range  of  signal  and  noise  bandwidths  and  c's. 

Other  situations  we  examined  in  the  noncausal  case  include  ones  with 

3  2  2  2  2 

J  and/or  v  as  second-order  Maritov  (i.e.  having  the  form  4a  v  /(a  +  m  )  ) 
or  using  bendliaited  white  noise.  The  results  for  all  these  cases  were  very 
similar  to  those  already  presented  (e.g.  Fig.  3). 

Of  particular  interest  is  the  case  of  an  e-contaminated  first-order 
Markov  signal  in  e-contaminated  bandlimited  white  noise.  Even  when  the 
bandwidth  of  the  noise  was  extremely  large  (e.g.  10^)  the  results  were  similar 
to  the  other  cases  and  unlike  those  Involving  nonbandlimited  white  noise 
(cf.  the  remarks  at  the  end  of  Section  3). 

In  summary,  the  Wiener  filter  can  be  undesirably  sensitive  to  small 
deviations  from  assumed  spectral  models.  Furthermore,  while  there  are 
enough  specific  cases  to  the  contrary  to  make  caution  advisable,  we  have 
found  for  a  wide  variety  of  situations  that,  when  spectral  uncertainty 
exists,  the  robust  Wiener  filter  is  generally  preferable  to  the  traditional 
Wiener  filter. 
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ABSTRACT 

A  alnlaax  formulation  of  tba  problam  of  de¬ 
signing  dlscrece-cima  sauochers,  flltart  and 
pradlctora  whan  knowledge  of  tba  signal  and  noisa 
speccra  la  laaxact  la  praaantad.  A  raault  la 
glvan  which  convarta  Che  minima*  problam  to  a 
maximization  problam.  Explicit  solutions  ara 
glvan  for  tha  casa  of  a  contaadnatad  wida-sensa 
Markov  signal  In  white  noise.  Tha  results  of  a 
numerical  study  of  the  inherent  crada-off  In* 
volved  In  robust  filtering  ara  given  for  this 
case  and  the  preferability  of  the  robust  filter 
is  demonstrated  for  a  wide  range  of  input  SNR' s 
and  signal  bandwidtha. 

1.  INTRODUCTION 

In  tha  traditional  formulation  of  tha  linear 
d in  inane-mean- square -error  (MMSE)  signal  estima¬ 
tion  problam.  If  tha  signal  and  noisa  processes 
ara  uncorralatad  with  each  other,  tha  solutions 
to  both  the  causal  and  noncausal  versions  of  this 
problam  depend  only  on  cha  power  spectra  of  tha 
signal  and  noise.  In  practice,  however,  the 
spectral  properties  of  cha  signal  and/or  noise 
may  not  be  known  with  complete  certainty.  To 
account  for  such  uncertainty  several  studies  have 
investigated  possible  designs  of  robust  filters. 
Robust  filters  are  filters  which,  while  not 
necessarily  performing  as  well  as  the  optimal 
filter  when  the  signal  and  noise  spectra  are  as 
anticipated,  do  not  experience  a  significant  de¬ 
crease  in  relative  performance  over  soma  class 
of  spectra  "near"  Che  anticipated  pair.  On  the 
ocher  hand  the  so-called  optimal  filter  (l.a., 
tha  one  designed  based  on  the  anticipated  model) 
may  experience  draswtic  degradation  of  perfor¬ 
mance  under  small  deviations  (see  (}]). 

So  far  cha  chruac  of  this  research  has  been 
for  continuous-time  filtering  ((1J,(2)).  In  this 
paper  we  consider  cha  discrete-time  case  for 
causal  linear  signal  estimation  (which  Includes 
sanoching,  filtering,  and  prediction).  In  Sec¬ 
tion  2  we  present  a  formulation  of  tha  robust 
linear  causal  discrate-clam  signal  estimation 
problem  and  a  mlnlmax  type  theorem  which  yields 
a  general  design  approach  for  these  robust  signal 
estimators.  This  formulation  is  analogous  to 
that  presented  In  Section  II  of  (21  for  contin¬ 
uous-clan  robust  filtering.  In  Section  3  we 


consider  specific  models  of  signal  and  noise 
and  present  some  numerical  results.  Finally  In 
Section  4  we  discuss  these  results  along  with 
sosm  possible  copies  for  further  study  in  this 

area. 

2.  THE  GENERAL  FORMULATION 

Throughout  this  paper  we  denote  the  sat  of 
integers  by  Z  and  we  assume  Chat  we  observe  a 
realisation  fy(k)|k€Z]  of  a  random  process 
[  Y(k)  |  k€  Z]  given  by 

Y(k)  «S(k)+N(k),  k  €  Z,  (2.1) 

where  [S(k)|k€Zj  and  (N(k)|k€Z)  are  wlde-sense 
stationary  random  processes  which  are  uncorre¬ 
lated  with  each  other.  Ue  denote  their  spectral 
meaauras  by  ms  and  m^,  respectively.  MS  also 
assume  that  tha  mean' of  {N(k)|k€Z}  Is  taro. 

Our  purpose  is  to  fora  a  linear  estimate  of  a 
linear  function  of  [s(k)}  (tha  signal)  from 
{y(k>}  (the  observation). 

By  a  spectral  measure  we  mean  specifically 
a  nonnegative  Boral  measure  (l.e.  a  countably 
additive  measure  on  the  Boral  sacs,  sea  [4])  on 
U,  cha  unit  circle  of  tha  complex  plane.  Given 
such  a  spectral  measure  m  we  denote  by  HL(dm) 

Che  Hardy  subspace  of  L2(dm),  which  is  cha  sub¬ 
space  spanned  by  [e^|n  -  0,1,2,...}  [4],  For 
the  estimation  problam  discussed  above 
H2(d  (mg  +mn) )  is  Che  space  of  all  mean-square 
lncagrable  causal  transfer  functions,  while 
L2 (d (mg  + mu) )  also  Includes  the  noncausal  ones. 

Let  D(8  )  represent  a  "desired"  linear  oper¬ 
ation  on  the  signal  process  (S(k)};  for  example 
DQ)  •  a'1-®  represents  smooching  for  n  <  0, 
filtering  for  n  •  0  and  prediction  for  n  >  0. 

For  each  pair  of  speccral  measures  (mg, an)  and 
craasfar  function  K€  L2  (d (mg  +  m^ ) )  the  mean- 
square  error  eD(dmg,4e!);H)  is  given  by 

,  TT  , 

(2")  j  I  DO)  -H<8)|  dms(p) 

•TT 

(2.2) 

i  n  1 

*  (2rr)*1  j-  |H(8)r  <K,@>  • 

•*T 

For  each  pair  (mj.a^)  cha  problem 
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min 

H  6  H^nj+mj,)) 


•o(d“s,d*NiH) 


(2.3) 


has  a  solution  H^  (ths  optimal  causal  transfer 
function)  which  is  uniquely  defined  a. a.  with 
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respect  to  s>s+my.  We  denote  by  a£(dmg,dmy)  tha 
optimal  value  aQ’<das>dav(iH' ) . 

Unfortunately,  thara  Is  no  general  expres¬ 
sion  for  eg  (dmg ,  toff) ,  however  for  a  wide  variety 
of  caaaa  tall  whan  D  and  tha  derivative  with 
respect  to  Lebeagua  measure  of  the  absolutely 
continuous  part  of  my  are  rational)  Snyders  (3) 
has  given  specific  expressions;  further  he  has 
given  a  systematic  method  of  finding  others. 

This  will  be  important  in  Section  3. 

Of  course,  this  traditionel  formulation  of 
dlscrete-tlan  causal  signal  estimation  assumes 
exsct  knowledge  of  the  signal  and  noise  spectra. 
As  we  leant ionad  in  Section  1  lc  la  often  un¬ 
realistic  to  assume  this  knowledge,  and  hence  it 
is  of  interest  to  design  a  robust  signal  ast lma- 
tor.  The  design  approach  we  consider  hare  is 
analogous  to  chat  of  robust  Wiener  filtering  as 
considered  by  Poor  [2]  in  a  formulation  bated 
on  the  work  of  Kasaam  and  Urn  [1]. 

Specifically,  we  assume  chat  the  signal  and 
noise  spectra,  mg  and  sty,  are  known  only  to  be¬ 
long  to  some  nonincersectlng  classes,  J  and  *?. 
That  is,  we  know  chat  mg  €  V  and  o^|  €  ?!  but  ua 
do  not  know  which  members  of  J  and  ?!  are  tha 
true  signal  and  noise  spacers. 

We  consider  a  transfer  function  Hr  to  be 
robust  for  J  and  ~  if  thara  is  an  upper  bound 
for  e<mg,im,;Ha)  over  ell  (mg.my)  S  J  x  ?f.  The 
adjective  robust  is  appropriate  since  this  upper 
bound  would  give  a  guaranteed  level  of  perfor¬ 
mance.  Ideally  we  would  like  co  have  the  beet 
such  transfer  function.  Thus  we  define  a  most 
robust  cauaal  cransfar  function  to  be  a  solution 
Hj  co  the  game 

min,  sup  a  (dm  .dm,; H)  (2,4) 
H€  K  (m^sySe-'X-!  D  3  ^ 

whara  h  •  H  €  H?(dB)|  sup  |h@)|  s  l]. 

In  an  effort  to  find  a  most  robust  causal 
cranafar  function  we  make  the  following  defini¬ 
tion. 

Definition:  A  pair  of  spectral  measures  (mg, my) 
is  least  favorable  for  cimaal  linear  estimation 
for  the  claaaet  J  and  *  if 


min  e  (dm^dm^H) 
h€  Kt  D  S  S 

■  sup 

(.s,ay€Vx*J 

Thu*  If  we  could  find  a  lent- favorable  pair 
(mg, my)  «  J*  ?!,  we  would  have  our  sought-after 
most-robust  estimator;  tha  optimal  causal  crans¬ 
far  function  for  (mg ,my) .  Tha  purposa  of  our 
main  thaoram  (below)  is  co  facilitate  the  search 
for  a  least-favorable  pair. 

Ue  now  view  J  end  ?!  as  subsets  of  the  Banach 
■pace,  3,  of  finite  Borel  measures  an  U  sod  con¬ 
sider  3  endowed  with  its  week*  topology  (see 
[10]).  A  sequence  {m,,}  converges  co  e  measure  m 
In  this  topology  If  f  dug  converges  to  *  f  dm 
0  U 

for  every  continuous  function  f  an  U,  (In  the 
probability  literature  this  la  usually  referred 
co  as  waak  conversance  of  (probability)  measures 

(see  [9]).) 


an(dm„.dm„;H,) 


(2.6) 


We  are  now  ready  to  present  our  main 
thaoram  which  reduces  the  problem  of  finding  a 
least-favorable  pair  (end  hence  a  robust  trans¬ 
fer  function)  to  maximising  tha  functional 
at  (dag, day)  over  J  x  ?!.  This  la  accomplished  by 
converting  the  mini max  problem  (2.4)  to  a  maxi- 
min  problem  whose  "min"  pert  (given  by  (2.3)) 
has  already  baen  solved  by  Snydare  [3]. 

Theorem.  If  J  and  ?!  are  nonincartacclng  convex 
waak’-campect  subsets  of  3  then  (mjr.mjj)  €  J*  T[ 

is  e  least- favorable  pair  for  cauaal  linear 
estimation  If  and  only  if 


•D(d*SldV 


(2.7) 


♦ 

where  a^dmj.dm^)  la  dafinad  above  via  (2.3). 


Proof.  If  "only  If"  part  of  tha  theorem  follows 
directly  from  the  definitions  of  least-favors* 
blllcy  end  e^(djeg,dahf) ;  l.e.,  tor 

•il  (,s;V  €  J  *  n‘  *  *o (l**s ’ ^*51  ’ 

h£)  a  e*  (da^  ,da^>.  In  feet  this  is  so  for 

arbitrary  subsets,  J  end  *!,  of  3.  on  the  other 
hand,  tha  "if”  part  does  depend  on  Che  hypothesis. 


V^S'dV^  ‘  Vd*S’d-H;Hl>  «*S> 

for  all  (mg.ay)  5  j/x?t,  where  h£  la  tha  optimal 

cauaal  transfer  function  for  (mi,  mi)  defined  via 
(2.3). 

It  la  straightforward  co  see  chat  a  pair 

(mg.my)  i  J*7[  is  laaat  favorable  If  end  only  If 
It  end  ice  optimal  transfer  function  form  a 
saddUpolnt  solution  to  the  game  (2.4),  l.e. 


By  definition  a  pair  (ag,ay)  la  Isaac 
favorable  for  cauaal  linear  ateisucion  for  classes 
J  and  7!  of  spectre  If  a.  (dm,,  dm,;  a:)  *  a  (dm“, 

d^;<)  for  all  (^.n/s/xVvhar.  11,  the 
optimal  cauaal  transfer  function  for  (m^  ,m[J) 

defined  by  (2.3).  we  wish  to  show  that  this  is 
true  for  a  pair  (m",a“)  €  J  x  ?!  if  J  and  "!  are 
weak*  corset  and  if  (2.7)  holds,  i.a.  If 

*D(daS,d"S’aL)  *  *D(<taS .i»y;Ht).  for  all  (mg.my)  € 

J  x  ?!,  whara, for  sach  pair  (mg.m,),  H  is  Che 


causal  transfer  function  which  ia  optimal  for 
chat  pair. 


« -mixture,  total-variation,  and  Prokhorov  nodal a 
saa  [6)  and  [7];  for  tha  hand  and  p-point  models 
saa  [8)).  Thus  thaaa  raaulta  ara  quits  general. 


.'iota  that  if  mini max  • quality  holds  for  tha 
gape  (2.4)  (l.a.  if  (2.6)  holds)  than  tha  pair 
(m“,a£)  satisfying  (2.7)  ia  claarly  Isaac  favor- 

abls  for  J  and  *7  (saa  Equation  (2.6)).  Thus  our 
problem  raducaa  to  showing  that  aininax  equality 
holds.  Va  will  make  use  of  tha  following  (from 
Theorem  1.5,  pp.  141-144  in  [11]). 

Logs.  If  X  and  X  ara  separable  topological 
linear  spaces;  A  and  B  ara  compact  convex  sub¬ 
sets  of  X  and  X,  respectively ;  and  t  ia  a  real 
valuad  upper -lower -aaaic one inuoua  coocava-con- 
vex  function  on  A  x  S  then  F  satisfies  the 
ainiaax  equality  on  A  x  B. 

We  wish  to  apply  this  iaaaa  with  X  •  3x3 
(recall  that  3  is  tha  space  of  finite  Bora l 
measures  on  endowed  with  its  weak*  topo¬ 

logy,  X  la  endowed  with  the  corresponding  pro¬ 
duct  topology);  X  *  Hx(d8)  (tha  Hardy  subspace 
of  L2(dB)  spanned  by  '.a^Sn  *  0,1,2,...]  hare 
endowed  with  its  weak  topology);  A  •  J  x  7!;  and 
3  *  Our  function  F  on  A  X  B  ia  given  by 

FUBj.a^.H)  -  e^dag.dnjjjH). 

Tha  only  proparty  which  ia  not  obvious  (or 
sc  Isaac  straightforward  to  show)  ia  chat  K*  ia 
compact.  From  [10],  Sections  V.l  and  V.4,  wa 
saa  that  all  we  need  to  show  is  that  X*  ia  , 
bounded  end  cloeed  in  the  norm  topology  of  HJ(dB). 

Boundedness  is  straightforward  since  ;ln|j  • 

>H®)i"dB  s  2r.  For  cloeednaee  we  mist  show 

•  rr  ft 

chet  if  H  -  H  In  norm  (i.e.  if  lim  ]  H  (9 )  - 
n  __  _  *  _  n 

n-»  -r  . 

HO  )|  <#  •  0)  and  if  =  X  then  S  (  l, 

Assume  if  i  Chan  there  ia  a  Borel  set  ECU 
such  chac  (E)  >  0  (where  X  la  Labe ague -Bor el 
measure  on  U)  and  H<9 )  >  l  *  a  for  soma  a  >  0 
«  , 

and  for  all  9  i  E.  But  |  H  (9 )  -  H(B  )|‘dB  * 

»  2  2.  n 

_  ‘  Ha 0 ) -H(9)|  dB  >  a*\(t)  >  0  for  all  n.  Con- 
5  t 

eradictlonl  Banco  X  la  co^act  ia  cbo  woak 
topology  of  «£(<• )  and  wa  may  apply  the  Lemma. 
This  completes  the  proof  of  tha  chaeram. 


Abstractly,  tha  significance  of  robust 
signal  as time cion  la  clear:  to  be  ah  la  to  put 
tha  tightest  upper  bound  on  tha  error  whoa  tho 
possibility  of  deviation  from  tha  assumed  spectra 
axiacs  la  claarly  desirable.  However  in  moat 
situations  wa  mat  also  expect  chac  tha  robust 
estimator  will  not  perform  as  wall  as  tha  assumed 
(or  nominal)  estimator  if  tha  true  spectra  ara 
tha  nominal  spectra.  So  thoro  la  a  trade-off. 
Thus  tha  questions  that  naturally  arise  ara  how 
much  la  gained  by  the  robust  estimator  la  its 
worst  case  (at  (nj.njp)  as  compared  to  tha 

nominal  estimator  at  Its  worst  case  and  how  much 
is  lost  in  using  tha  robust  estimator  should  tha 
true  sptcera  be  the  nominal  ones.  Claarly  a 
blanket  statement  of  tha  superiority  of  one 
tat last or  over  tho  other  in  ail  casoa  la  not 
likely  to  prove  correct.  Thus  wa  undart  aka,  in 
the  next  sect ion.  an  attempt  to  answer  thaaa 
questions  with  soma  numerical  ax allies. 

3.  AM  EXAMPLE 


In  this  aaccion  wa  consider  robust  filter¬ 
ing  of  a  contaminated  wide-tense  Markov  signal 
white  noise.  Snyders  [3]  has  shown  that,  if 
rapresants  white  noisa  (l.a.  d*i}(9  )  ■  2r 
for  soma  positive  constant  Kg)  and' if  0(9) 
rapraaanta  filtering  (l.a.  0(9 )  a  l),  than 

^  *  iogd  +  s*1-^  <$))• 


2rN0<* 


*D(d*S’<%>  "  V1  * 


2tt  * 


(3.1) 


for  any  signal  tpecerta  me.  Wa  wish  to  consider 
the  case  whan  7;  •  ]mS]  and  J  •  fm£5jm(A)  • 
(l-t)a£(A)  *■  m'  A  e  S-,  for  tons  nonnags- 

Civs  a*  €  3  with  m1  (TJ)  *  a?(U)  •  2rrv*}  where  m? 
rapraaanta  a  nominal  signal  apoccrum,  hare  glvin 
by  dmjjfl)  •  (v* (l-r^)/ (l-2r  eoeS  * r2) ]dB  for 
r  6  [-1,1].  and  a  S  [0,1]  rapraaanta  tha  degree 
of  possible  contamination  or  error  in  assuming 
that  a ia  the  true  signal  apoccrum. 

W 

Tha  particular  mg  wa  have  ehoaan  is  tha 
spectrum  of  a  wide -tanas  Markov  process.  It  has 
power  v2  and,  for  r  a  3  -2vT  •  .172,  it  has  3dB 
power  bandwidth 


Tha  hypothesis  of  tho  theoran  (that  J  and 
"  bo  convex  waak*-coapact  aubaata  of  8)  la  oot 
overly  restrictive  for  our  purpooea.  Moat  Im¬ 
portantly,  a  capacity  clan  (l.a.  a  class  at 
tha  form  at 3|  m(A)  s  v(a)  for  all  Borel  aubaata, 
A,  of  U,  m(U)  •  v(U)}  for  a  capacity  v)  ia  veak*- 
coapact  (!«],  Lamas  2  .2)  and  convex.  A  capacity 
la  a  sat  function  which  generalises  tha  Idas  of 
a  aaaaura  and  was  introduced  by  Choquat  [ 12]  and 
applied  by  Buber  and  Strsaaan  [(]  to  nodal  un¬ 
certainty  in  robust  hypothesis  casting.  All  tha 
aoat  coanoniy  used  "robustness  neighborhoods" 
have  boon  shown  to  bo  capacity  classes  (for  tha 


eoe^Ur2 -4r  +  l)/(-Zr)]  .  (3.2) 

Substituting  the  expression  for  a$  into  (3.1)  wa 
obtain  o^(dn^,da^|) .  Alternatively,  since 

datjVdB  ia  rational,  we  can  datamiaa  that  tha 
optlmal^cauaal  transfer  function  for  the  noninal 
pair  (a^.mj)  ia  glvan  by  (soa  [13],  Obapcar  7) 

«!<*)  * - ^TTT  *  *  0  (3.3) 

1  ~  a* 

where  a  •  (b  -  A^-4r^)/2r,  X»  avJ  (l-rJ)/ (Mpd-ra)) 


and  b  ■  (v2 rl-r2)/M. v (l+r2) ] .  Ue  can  chan  sub¬ 
stitute  <3. 3),  m"  and  mj2  into  (2.3)  to  obtain 
*  M  tf  s  3 

«-(dm‘..,daM).  Further,  chi*  procedure  may  be 

J  3  ,j  * 

used  to  obtain  a^ctau.da^.iU)  for  any  signal 
spacenM  05;  in  particular,  it  nay  ba  ua«d  Co 
ur  u  *  u  * 

obtain  aQ  (di*s  .d^jHg)  -  sat  eD  (d*s , da^, ; Hg ) . 

*5  ”  J 

It  ia  easy  to  lee  chat  *^c  (VC  stand*  tor  worst 
easel  1*  given  by  (l-«)«j  +■  smj  where  aj(A)  • 

2-v*  iff  0  €  A,  since  [  l  •  H(0)l  2  ■  supl  1  -  H (9  ) |  2. 

So  we  heve  error  expression*  for  Hq  at  the 
nominal  pair  sad  in  its  worst  case.  If  there  is 
s  significant  difference  between  these  errors 
WC  u  ♦ 

(l.e.  if  £g(da$  .dSjjiHg)  is  such  larger  than 
*D(da*,da^))  for  reasonable  values  of  *  (say 

<  •  .1)  then  a  clear  need  for  robust  filtering 
exists,  the  (alniaax)  robust  filter  H«  designed 
in  Section  2,  will  satisfy  this  need  it 

Sjj(d»“,da^J)  is  relatively  close  to  *0(dsij,dsJ) . 

Later  in  this  section  we  will  show  chat  this  is 
often  Che  case.  First  we  oust  find  tha  Least* 
favorable  signal  spectrum,  at,  so  chat  we  aay 

use  (3.1)  to  calculate  eD(das,daJj). 

Because  of  the  power  constraint  (i.e. , 
m(U)  »  v2  f  m  i  **)  we  aey  use  Che  results  of 
section  til  of  [21  (particularly  Lsseis  l)  and 
of  [14]  to  say  that 


2-vZq-cm-r2) 
l  *  2r  coed  r2 


2*tv  (l-«)c 


dm" 

ll  -3^(9)>2rtc 

d-M  ( 

if  *  2ttc 


vh*r«  c  is  s  potltlv*  conjccnc  which  can  always 

*T 

be  determined  so  that  '  dm^  (9  )  »  2ttv2  .  We  may 

now  substitute  (3.4)  into  (3.1)  to  calculate 
*  L  W 

e[J(dm.,dmJ() ,  che  worst  case  error  for  tha  robust 
filter  Og. 

For  comparison  purpose*  the  three  error 

*  m  y 

expression*  discussed  above  (e.  (da^.dm^) , 
*0(<h^C.dm^;H0)  and  e^dm^.da^) )  were  nntaallaed 

by  eh*  signal  power  r2  and  viewed  ea  fractions 
of  the  input  slgnel-to-noise  ratio  (SMB)  for 
each  value  of  the  uncertainty  parameter  «  and 
eh*  bandwidth  parameter  r.  (Bacall  that  r 
completely  determines  the  signal  bandwidth  via 
(3.2).)  Thee*  three  normallaed  error  quantities 
were  evaluated  for  a  variety  of  values  of  tha 
parameters  e  end  r  and  far  a  large  range  of  in- 
pur  SMB  (which  for  convenience  was  converted  to 
dg). 


Cartful  examination  of  FIG.  I  (hare  the  3dB 
power  bandwidth  le  .001,  c  •  .1  and  c  •  .121) 
shows  that  for  SNR's  between  20  dB  and  40  dB  the 
worst  css*  performance  of  ch*  nominal  filtar 

WC  w  * 

(aQ(das  .dm^iHjj))  La  an  ordar  of  magnitude  or 

more  greater  than  the  wortt  cate  performance  of 

*  r  y 

tha  robust  filtar  (a^dm^.dm^)).  On  tha  othar 

hand  at  20  dg  this  worst  case  performence  of  the 
robust  filter  It  lets  then  helf  an  order  of  mag¬ 
nitude  greeter  then  chet  of  che  nominal  fllcar 

at  tha  nominal  pair  (e^dm^.dm^),  the  ''optimal'' 

performance  for  the  original  (nonrobuac)  problem). 
Further  chla  difference  deereaeee  rapidly,  to 
chat  from  40  dg  on  chare  is  essentially  no  dif- 
faranca  between  tha  worst  patfotmanca  of  tha 
robuat  filtar  and  tha  performance  of  che  nominal 
filtar  whan  ch*  signal  spectrum  la  at  originally 
assumed;  1.*.,  above  40 dB  we  loaa  nothing  by 
using  the  robust  filter  end  can  at  tha  same  time 
gain  e  consider able  improvement  in  worst  case 
performance. 

Uhlla  the  situation  pictured  in  FIG.  I  is 
typical  of  moat  cases  we  encountered,  FIG.  2 
(hare  tha  bandwidth  It  .103,  t  •  .1  and  c  *  .232) 
is  representative  of  some  other  ceeae.  The 
differences  sung  tha  three  error  quantities  ere 
small  throughout  che  pictured  range  (end  in  feet 
they  are  smaller  elsewhere).  Here  there  la  little 
need  for  robust  filtering  end  of  course  the  ro¬ 
bust  filtar  accomplishes  little.  It  should  bo 
notod,  however,  that  Alia  the  differences  are 
email,  we  see  Chet  for  input  SMB  close  to  0  dB 
the  worst  ceaa  of  the  robust  filter  is  ea  bad  as 
chat  of  cho' nominal  fllcar.  However,  we  have 
found  throughout  our  Investigation  chat  the  only 
situations  in  which  tha  robuat  filcar  t*  Likely 
to  bo  disadvantageous  art  situations  (such  as  in 
FIG.  2)  when  all  tha  error*  of  both  filters  art 
too  largo  to  be  practical. 

4.  CONCLUSIONS 

In  this  paper  we  have  shown  that  (alniaax) 
robuat  causal  sMochere,  filters,  and  pradlccors 
can  ba  sought  by  maximizing  tha  functional 

*0(dm,,dau)  if  tha  spactrsl  class**  s'  and  T?  are 
convex  ana  satisfy  a  topological  condition  which 
all  coamonly  used  models  of  uncertainty  are 
known  to  satisfy.  Further,  for  che  case  of  e 
cent eat  noted  wlde-eense  Markov  signal  in  white 
noise  we  have  demons crated,  for  a  wide  variety 
of  input  SMB's,  chat  tha  trade-off  of  possible 
laisaMd  nominal  case  performance  In  return  for 
che  improved  worst -case  performence  of  tho  robust 
filter  le  a  very  good  coo.  Moreover,  chls 
particular  esse  le  e  very  reasonable  model  for 
assy  application*. 

Uhlla  tha  numerical  result*  of  Soctlon  3 
valldac*  the  analytical  results  of  Section  2  for 
robust  filtering,  chnee  analytical  reeulti  ere 
much  broader.  Thu* ,  it  would  ba  interesting  to 
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consider  a  numerical  treatment  of  both  the  ro¬ 
bust  smoothing  and  prediction  problems,  analyti¬ 
cal.  solutions  to  which  aay  ba  obtained  via  tha 
nachods  of  Saction  2. 
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F1C.  1:  Nominal  filter  at  worst  case  (solid 
line),  robust  filter  at  worst  case 
'dotted  line),  and  nominal  filter 
at  nominal  case  (dashed  line)  when 
bandwidth  •  .901  and  (  •  .1. 


Nominal  filter  at  worst  case  (solid 
line),  robuae  filter  at  worse  esae 
(dotted  line),  and  nominal  filter 
at  nominal  case  (dashed  line)  when 
bandwidth  •  .103  and  *  •  .1. 
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